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NEISSERIA MENINGITIDIS ANTIGE NS AND COMPOS ITIONS 

This application is a continuation-in-part of the following U.S. Provisional Patent 
applications, from each of which priority is claimed, and each of which is incorporated by 
reference in its entirety: 60/083,758 (filed May 1, 1998); 60/094,869 (filed July 31, 1998); 
60/098,994 (filed September 2, 1998); 60/099,062 (filed September 2, 1998); 60/103,749 
(filed October 9, 1998); 60/103,794 (filed October 9, 1998); 60/103,796 (filed October 9, 
1998); and 60/121,528 (filed February 25, 1999). 

This invention relates to antigens from the bacterial species: Neisseria meningitidis 
and Neisseria gonorrhoeae. 

BACKGROUND 

Neisseria meningitidis is a non-motile, gram negative diplococcus human pathogen. It 
colonizes the pharynx, causing meningitis and, occasionally, septicaemia in the absence of 
meningitis. It is closely related to N. gonorrhoea, although one feature that clearly 
differentiates meningococcus from gonococcus is the presence of a polysaccharide capsule 
that is present in all pathogenic meningococci. 

N. meningitidis causes both endemic and epidemic disease. In the United States the 
attack rate is 0.6-1 per 100,000 persons per year, and it can be much greater during outbreaks, 
(see Lieberman et al (1996) Safety and Immunogenicity of a Serogroups A/C Neisseria 
meningitidis Oligosaccharide-Protein Conjugate Vaccine in Young Children. JAMA 
275(19):1499-1503; Schuchat et al (1997) Bacterial Meningitis in the United States in 1995. 
N Engl J Med 337(14):970-976). In developing countries, endemic disease rates are much 
higher and during epidemics incidence rates can reach 500 cases per 100,000 persons per 
year. Mortality is extremely high, at 10-20% in the United States, and much higher in 
developing countries. Following the introduction of the conjugate vaccine against 
Haemophilus influenzae^ N meningitidis is the major cause of bacterial meningitis at all ages 
in the United States (Schuchat et al (1997) supra). 

Based on the organism's capsular polysaccharide, 12 serogroups of AT. meningitidis 
have been identified. Group A is the pathogen most often implicated in epidemic disease in 
sub-Saharan Africa. Serogroups B and C are responsible for the vast majority of cases in the 
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United States and in most developed countries. Serogroups W135 and Y are responsible for 
the rest of the cases in the United States and developed countries. The meningococcal vaccine 
currently in use is a tetravalent polysaccharide vaccine composed of serogroups A, C, Y and 
W135. Although efficacious in adolescents and adults, it induces a poor immune response and 
short duration of protection, and cannot be used in infants [eg. Morbidity and Mortality 
weekly report, Vol.46, No. RR-5 (1997)]. This is because polysaccharides are T-cell 
independent antigens that induce a weak immune response that cannot be boosted by repeated 
immunization. Following the success of the vaccination against HAnfluenzae^ conjugate 
vaccines against serogroups A and C have been developed and are at the final stage of clinical 
testing (Zollinger WD "New and Improved Vaccines Against Meningococcal Disease". In: 
New Generation Vaccines, supra, pp. 469-488; Lieberman et al (1996) supra; Costantino et al 
(1992) Development and phase I clinical testing of a conjugate vaccine against 
meningococcus A and C. Vaccine 10:691-698). 

Meningococcus B (menB) remains a problem, however. This serotype currently is 
responsible for approximately 50% of total meningitis in the United States, Europe, and South 
America. The polysaccharide approach cannot be used because the menB capsular 
polysaccharide is a polymer of a(2-8)-linked TV-acetyl neuraminic acid that is also present in 
mammalian tissue. This results in tolerance to the antigen; indeed, if an immune response 
were elicited, it would be anti-self, and therefore undesirable. In order to avoid induction of 
autoimmunity and to induce a protective immune response, the capsular polysaccharide has, 
for instance, been chemically modified substituting the 7V-acetyl groups with Af-propionyl 
groups, leaving the specific antigenicity unaltered (Romero & Outschoorn (1994) Current 
status of Meningococcal group B vaccine candidates: capsular or non-capsular? Clin 
Microbiol Rev 7(4):559-575). 

Alternative approaches to menB vaccines have used complex mixtures of outer 
membrane proteins (OMPs), containing either the OMPs alone, or OMPs enriched in porins, 
or deleted of the class 4 OMPs that are believed to induce antibodies that block bactericidal 
activity. This approach produces vaccines that are not well characterized. They are able to 
protect against the homologous strain, but are not effective at large where there are many 
antigenic variants of the outer membrane proteins. To overcome the antigenic variability, 
multivalent vaccines containing up to nine different porins have been constructed {eg. 
Poolman JT (1992) Development of a meningococcal vaccine. Infect. Agents Dis. 4:13-28). 
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Additional proteins to be used in outer membrane vaccines have been the opa and opc 
proteins, but none of these approaches have been able to overcome the antigenic variability 
(eg. Ala'Aldeen & Borriello (1996) The meningococcal transferrin-binding proteins 1 and 2 
are both surface exposed and generate bactericidal antibodies capable of killing homologous 
and heterologous strains. Vaccine 14(l):49-53). 

A certain amount of sequence data is available for meningococcal and gonoccocal 
genes and proteins (e.g. EP-A-0467714, W096/29412), but this is by no means complete. 
Other men B proteins may include those listed in WO 97/28273, WO 96/29412, WO 
95/03413, US 5,439,808, and US 5,879,686. 

The provision of further sequences could provide an opportunity to identify secreted 
or surface-exposed proteins that are presumed targets for the immune system and which are 
not antigenically variable. For instance, some of the identified proteins could be components 
of efficacious vaccines against meningococcus B, some could be components of vaccines 
against all meningococcal serotypes, and others could be components of vaccines against all 
pathogenic Neisseriae including Neisseria meningitidis or Neisseria gonorrhoeae. Those 
sequences specific to N. meningitidis or N. gonorrhoeae that are more highly conserved are 
further preferred sequences. 

It is thus an object of the invention is to provide Neisserial DNA sequences which 
encode proteins that are antigenic or immunogenic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the products of protein expression and purification of the predicted 
ORF 919 as cloned and expressed in E. coli. 

Fig. 2 illustrates the products of protein expression and purification of the predicted 
ORF 279 as cloned and expressed in E. coli. 

Fig. 3 illustrates the products of protein expression and purification of the predicted 
ORF 576-1 as cloned and expressed in E. coli. 

Fig. 4 illustrates the products of protein expression and purification of the predicted 
ORF 519-1 as cloned and expressed in E. coli. 

Fig. 5 illustrates the products of protein expression and purification of the predicted 
ORF 121-1 as cloned and expressed in E. coli. 
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Fig. 6 illustrates the products of protein expression and purification of the predicted 
ORF 128-1 as cloned and expressed in£. coli. 

Fig. 7 illustrates the products of protein expression and purification of the predicted 
ORF 206 as cloned and expressed in E. coli. 

Fig. 8 illustrates the products of protein expression and purification of the predicted 
ORF 287 as cloned and expressed in E. coli. 

Fig. 9 illustrates the products of protein expression and purification of the predicted 
ORF 406 as cloned and expressed in E. coli. 

Fig. 10 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 919 as cloned and expressed in E. coli. 

Fig. 1 1 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 279 as cloned and expressed in£. coli. 

Fig. 12 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 576-1 as cloned and expressed in E. coli. 

Fig. 13 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 519-1 as cloned and expressed in E. coli. 

Fig. 14 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 121-1 as cloned and expressed in E. coli. 

Fig. 15 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 128-1 as cloned and expressed in E. coli. 

Fig. 16 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 206 as cloned and expressed in E. coli. 

Fig. . 17 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 287 as cloned and expressed in E. coli. 

Fig. 18 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 406 as cloned and expressed in E. coli. 

Fig. 19 shows an alignment comparison of amino acid sequences for ORF 225 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 
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Fig. 20 shows an alignment comparison of amino acid sequences for ORF 235 for 
several strains of Neisseria, Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 21 shows an alignment comparison of amino acid sequences for ORF 287 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 22 shows an alignment comparison of amino acid sequences for ORF 519 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 23 shows an alignment comparison of amino acid sequences for ORF 919 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

THE INVENTION 

The invention provides proteins comprising the N. meningitidis amino acid sequences 
and N. gonorrhoeae amino acid sequences disclosed in the examples. 

It also provides proteins comprising sequences homologous (i.e., those having 
sequence identity) to the N. meningitidis amino acid sequences disclosed in the examples. 
Depending on the particular sequence, the degree of homology (sequence identity) is 
preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). These proteins 
include mutants and allelic variants of the sequences disclosed in the examples. Typically, 
50% identity or more between two proteins is considered to be an indication of functional 
equivalence. Identity between proteins is preferably determined by the Smith-Waterman 
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homology search algorithm as implemented in MPSRCH program (Oxford Molecular) using 
an affine gap search with parameters:gap penalty 12, gap extension penalty 1. 

The invention further provides proteins comprising fragments of the TV. meningitidis 
amino acid sequences and N. gonorrhoeae amino acid sequences disclosed in the examples. 
The fragments should comprise at least n consecutive amino acids from the sequences and, 
depending on the particular sequence, n is 7 or more {eg. 8, 10, 12, 14, 16, 18, 20 or more). 
Preferably the fragments comprise an epitope from the sequence. 

The proteins of the invention can, of course, be prepared by various means {eg. 
recombinant expression, purification from cell culture, chemical synthesis etc.) and in various 
forms {eg, native, fusions etc.). They are preferably prepared in substantially pure or isolated 
form {ie. substantially free from other TV. meningitidis or N. gonorrhoeae host cell proteins) 

According to a further aspect, the invention provides antibodies which bind to these 
proteins. These may be polyclonal or monoclonal and may be produced by any suitable 
means. 

According to a further aspect, the invention provides nucleic acid comprising the 
N. meningitidis nucleotide sequences and N. gonorrhoeae nucleotide sequences disclosed in 
the examples. 

According to a further aspect, the invention comprises nucleic acids having sequence 
identity of greater than 50% (e.g., 60%, 70%, 80%, 90%, 95%, 99% or more) to the nucleic 
acid sequences herein. Sequence identity is determined as above-discussed. 

According to a further aspect, the invention comprises nucleic acid that hybridizes to 
the sequences provided herein. Conditions for hybridization are set forth herein. 

.Nucleic, acid comprising fragments of these sequences are also provided. These should 
comprise at least n consecutive nucleotides from the N. meningitidis sequences or N. 
gonorrhoeae sequences and depending on the particular sequence, n is 10 or more {eg 12, 14, 
15, 18, 20, 25, 30, 35, 40 or more). 

According to a further aspect, the invention provides nucleic acid encoding the 
proteins and protein fragments of the invention. 

It should also be appreciated that the invention provides nucleic acid comprising 
sequences complementary to those described above {eg. for antisense or probing purposes). 

Nucleic acid according to the invention can, of course, be prepared in many ways {eg. 
by chemical synthesis, in part or in whole, from genomic or cDNA libraries, from the 
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organism itself etc.) and can take various forms (eg. single stranded, double stranded, vectors, 
probes etc.). 

In addition, the term "nucleic acid" includes DNA and RNA, and also their analogues, 
such as those containing modified backbones, and also protein nucleic acids (PNA) etc. 

According to a further aspect, the invention provides vectors comprising nucleotide 
sequences of the invention (eg. expression vectors) and host cells transformed with such 
vectors. 

According to a further aspect, the invention provides compositions comprising protein, 
antibody, and/or nucleic acid according to the invention. These compositions may be suitable 
as vaccines, for instance, or as diagnostic reagents or as immunogenic compositions. 

The invention also provides nucleic acid, protein, or antibody according to the 
invention for use as medicaments (eg. as vaccines) or as diagnostic reagents. It also provides 
the use of nucleic acid, protein, or antibody according to the invention in the manufacture of 
(I) a medicament for treating or preventing infection due to Neisserial bacteria (ii) a 
diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised 
against Neisserial bacteria or (iii) for raising antibodies. Said Neisserial bacteria may be any 
species or strain (such as N. gonorrhoeae) but are preferably N. meningitidis, especially strain 
B or strain C. 

The invention also provides a method of treating a patient, comprising administering 
to the patient a therapeutically effective amount of nucleic acid, protein, and/or antibody 
according to the invention. 

According to further aspects, the invention provides various processes. 

A process for producing proteins of the invention is provided, comprising the step of 
culturing a host cell according to the invention under conditions which induce protein 
expression. 

A process for detecting polynucleotides of the invention is provided, comprising the 
steps of: (a) contacting a nucleic probe according to the invention with a biological sample 
under hybridizing conditions to form duplexes; and (b) detecting said duplexes. 

A process for detecting proteins of the invention is provided, comprising the steps of: 
(a) contacting an antibody according to the invention with a biological sample under 
conditions suitable for the formation of an antibody-antigen complexes; and (b) detecting 
said complexes. 
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Having now generally described the invention, the same will be more readily 
understood through reference to the following examples which are provided by way of 
illustration, and are not intended to be limiting of the present invention, unless specified. 

Methodology - Summary of standard procedures and techniques. 
General 

This invention provides Neisseria meningitidis menB nucleotide sequences, amino 
acid sequences encoded therein. With these disclosed sequences, nucleic acid probe assays 
and expression cassettes and vectors can be produced. The expression vectors can be 
transformed into host cells to produce proteins. The purified or isolated polypeptides (which 
may also be chemically synthesized) can be used to produce antibodies to detect menB 
proteins. Also, the host cells or extracts can be utilized for biological assays to isolate 
agonists or antagonists. In addition, with these sequences one can search to identify open 
reading frames and identify amino acid sequences. The proteins may also be used in 
immunogenic compositions, antigenic compositions and as vaccine components. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA, and 
immunology, which are within the skill of the art. Such techniques are explained fully in the 
literature e.g., Sambrook Molecular Cloning; A Laboratory Manual, Second Edition (1989); 
DNA Cloning, Volumes I and ii (D.N Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 
ed, 1984); Nucleic Acid Hybridization (BJD. Hames & S.J. Higgins eds. 1984); Transcription 
and Translation (B.D. Hames & S J. Higgins eds. 1984); Animal Cell Culture (R.I. Freshney 
ed. 1986); Immobilized Cells and Enzymes (IRL Press, 1986); B. Perbal, v4 Practical Guide to 
Molecular Cloning (1984); the Methods in Enzymology series (Academic Press, Inc.), 
especially volumes 154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); Mayer and Walker, eds. (1987), 
Immunochemical Methods in Cell and Molecular Biology (Academic Press, London); Scopes, 
(1987) Protein Purification: Principles and Practice, Second Edition (Springer- Verlag, 
N.Y.), and Handbook of Experimental Immunology, Volumes I-IV (D.M. Weir and C.C. 
Blackwell eds 1986). 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
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All publications, patents, and patent applications cited herein are incorporated in full 
by reference. 

Expression systems 

The Neisseria menB nucleotide sequences can be expressed in a variety of different 
expression systems; for example those used with mammalian cells, plant cells, baculoviruses, 
bacteria, and yeast. 

i. Mammalian Systems 

Mammalian expression systems are known in the art. A mammalian promoter is any 
DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g., structural gene) into mRNA. A 
promoter will have a transcription initiating region, which is usually placed proximal to the 5' 
end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream 
of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to 
begin RNA synthesis at the correct site. A mammalian promoter will also contain an 
upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. 
An upstream promoter element determines the rate at which transcription is initiated and can 
act in either orientation (Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian 
Cells." In Molecular Cloning: A Laboratory Manual, 2nd ed.). 

Mammalian viral genes are often highly expressed and have a broad host range; 
therefore sequences encoding mammalian viral genes provide particularly useful promoter 
sequences. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus promoter. In 
addition, sequences derived from non-viral genes, such as the murine metallothionein gene, 
also provide useful promoter sequences. Expression may be either constitutive or regulated 
(inducible). Depending on the promoter selected, many promotes may be inducible using 
known substrates, such as the use of the mouse mammary tumor virus (MMTV) promoter 
with the glucocorticoid responsive element (GRE) that is induced by glucocorticoid in 
hormone-responsive transformed cells (see for example, U.S. Patent 5,783,681). 

The presence of an enhancer element (enhancer), combined with the promoter 
elements described above, will usually increase expression levels. An enhancer is a 
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regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to 
homologous or heterologous promoters, with synthesis beginning at the normal RNA start 
site. Enhancers are also active when they are placed upstream or downstream from the 
transcription initiation site, in either normal or flipped orientation, or at a distance of more 
than 1000 nucleotides from the promoter (Maniatis et al. (1987) Science 236:1237; Alberts et 
al. (1989) Molecular Biology of the Cell, 2nd ed.). Enhancer elements derived from viruses 
may be particularly useful, because they usually have a broader host range. Examples include 
the SV40 early gene enhancer (Dijkema et al (1985) EMBO J. 4:761) and the 
enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus 
(Gorman et al. (1982b) Proc. Natl Acad. Sci. 79:6777) and from human cytomegalovirus 
(Boshart et aL (1985) Cell 47:521). Additionally, some enhancers are regulatable and become 
active only in the presence of an inducer, such as a hormone or metal ion (Sassone-Corsi and 
Borelli (1986) Trends Genet. 2:215; Maniatis et al. (1987) Science 236:1237). 

A DNA molecule may be expressed intracellular^ in mammalian cells. A promoter 
sequence may be directly linked with the DNA molecule, in which case the first amino acid at 
the N-terminus of the recombinant protein will always be a methionine, which is encoded by 
the ATG start codon. If desired, the N-terminus may be cleaved from the protein by in vitro 
incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign protein in mammalian cells. 
Preferably, there are processing sites encoded between the leader fragment and the foreign 
gene that can.be cleaved either in vivo or in vitro. The leader sequence fragment usually 
encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of 
the protein from the cell. The adenovirus tripartite leader is an example of a leader sequence 
that provides for secretion of a foreign protein in mammalian cells. 

Usually, transcription termination and polyadenylation sequences recognized by 
mammalian cells are regulatory regions located 3' to the translation stop codon and thus, 
together with the promoter elements, flank the coding sequence. The 3' terminus of the 
mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation 
(Birnstiel et al. (1985) Cell 41:349; Proudfoot and Whitelaw (1988) "Termination and 3' end 
processing of eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D.M. 
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Glover); Proudfoot (1989) Trends Biochem. ScL 14:105). These sequences direct the 
transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. 
Examples of transcription terminator/polyadenylation signals include those derived from 
SV40 (Sambrook et al (1989) "Expression of cloned genes in cultured mammalian cells." In 
Molecular Cloning: A Laboratory Manual). 

Usually, the above described components, comprising a promoter, polyadenylation 
signal, and transcription termination sequence are put together into expression constructs. 
Enhancers, introns with functional splice donor and acceptor sites, and leader sequences may 
also be included in an expression construct, if desired. Expression constructs are often 
maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as mammalian cells or bacteria. Mammalian replication 
systems include those derived from animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication systems of papovaviruses, such as 
SV40 (Gluzman (1981) Cell 23:175) or polyomavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine papillomavirus and Epstein-Barr 
virus. Additionally, the replicon may have two replication systems, thus allowing it to be 
maintained, for example, in mammalian cells for expression and in a prokaryotic host for 
cloning and amplification. Examples of such mammalian-bacteria shuttle vectors include 
pMT2 (Kaufman et al. (1989) Mol Cell Biol 9:946) and pHEBO (Shimizu et al. (1986) Mol 
Cell Biol 6:1074). 

The transformation procedure used depends upon the host to be transformed. Methods 
for introduction of heterologous polynucleotides into mammalian cells are known in the art 
and include dextran-mediated transfection, calcium phosphate precipitation, polybrene 
mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell lines available from the American Type Culture Collection 
(ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby 
hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma 
cells (e.g., Hep G2), and a number of other cell lines. 
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ii. Plant Cellular Expression Systems 

There are many plant cell culture and whole plant genetic expression systems known 
in the art. Exemplary plant cellular genetic expression systems include those described in 
patents, such as: U.S. 5,693,506; US 5,659,122; and US 5,608,143. Additional examples of 
genetic expression in plant cell culture has been described by Zenk, Phytochemistry 30:3861- 
3863 (1991). Descriptions of plant protein signal peptides may be found in addition to the 
references described above in Vaulcombe et al., Mol Gen. Genet. 209:33-40 (1987); Chandler 
et al., Plant Molecular Biology 3:407-418 (1984); Rogers, J. Biol Chem. 260:3731-3738 
(1985); Rothstein et al., Gene 55:353-356 (1987); Whittier et al., Nucleic Acids Research 
15:2515-2535 (1987); Wirsel et al., Molecular Microbiology 3:3-14 (1989); Yu et al., Gene 
122:247-253 (1992). A description of the regulation of plant gene expression by the 
phytohormone, gibberellic acid and secreted enzymes induced by gibberellic acid can be 
found in R.L. Jones and J. MacMillin, Gibberellins: in: Advanced Plant Physiology,. Malcolm 
B. Wilkins, ed., 1984 Pitman Publishing Limited, London, pp. 21-52. References that 
describe other metabolically-regulated genes: Sheen, Plant Cell, 2:1027-1038(1990); Maas et 
al., EMBOJ. 9:3447-3452 (1990); Benkel and Hickey, Proc. Natl Acad. Set 84:1337-1339 
(1987) 

Typically, using techniques known in the art, a desired polynucleotide sequence is 
inserted into an expression cassette comprising genetic regulatory elements designed for 
operation in plants. The expression cassette is inserted into a desired expression vector with 
companion sequences upstream and downstream from the expression cassette suitable for 
expression in a plant host. The companion sequences will be of plasmid or viral origin and 
provide necessary characteristics to the vector to permit the vectors to move DNA from an 
original cloning host, such as bacteria, to the desired plant host. The basic bacterial/plant 
vector construct will preferably provide a broad host range prokaryote replication origin; a 
prokaryote selectable marker; and, for Agrobacterium transformations, T DNA sequences for 
Agrobacterium-mediated transfer to plant chromosomes. Where the heterologous gene is not 
readily amenable to detection, the construct will preferably also have a selectable marker gene 
suitable for determining if a plant cell has been transformed. A general review of suitable 
markers, for example for the members of the grass family, is found in Wilmink and Dons, 
1993, Plant Mol. Biol Reptr, 1 1(2): 1 65-1 85. 
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Sequences suitable for permitting integration of the heterologous sequence into the 
plant genome are also recommended. These might include transposon sequences and the like 
for homologous recombination as well as Ti sequences which permit random insertion of a 
heterologous expression cassette into a plant genome. Suitable prokaryote selectable markers 
include resistance toward antibiotics such as ampicillin or tetracycline. Other DNA sequences 
encoding additional functions may also be present in the vector, as is known in the art. 

The nucleic acid molecules of the subject invention may be included into an 
expression cassette for expression of the protein(s) of interest. Usually, there will be only one 
expression cassette, although two or more are feasible. The recombinant expression cassette 
will contain in addition to the heterologous protein encoding sequence the following 
elements, a promoter region, plant 5' untranslated sequences, initiation codon depending upon 
whether or not the structural gene comes equipped with one, and a transcription and 
translation termination sequence. Unique restriction enzyme sites at the 5 ! and 3 1 ends of the 
cassette allow for easy insertion into a pre-existing vector. 

A heterologous coding sequence may be for any protein relating to the present 
invention. The sequence encoding the protein of interest will encode a signal peptide which 
allows processing and translocation of the protein, as appropriate, and will usually lack any 
sequence which might result in the binding of the desired protein of the invention to a 
membrane. Since, for the most part, the transcriptional initiation region will be for a gene 
which is expressed and translocated during germination, by employing the signal peptide 
which provides for translocation, one may also provide for translocation of the protein of 
interest. In this way, the protein(s) of interest will be translocated from the cells in which they 
are expressed and may be efficiently harvested. Typically secretion in seeds are across the 
aleurone or scutellar epithelium layer into the endosperm of the seed. While it is not required 
that the protein be secreted from the cells in which the protein is produced, this facilitates the 
isolation and purification of the recombinant protein. 

Since the ultimate expression of the desired gene product will be in a eucaryotic cell it 
is desirable to determine whether any portion of the cloned gene contains sequences which 
will be processed out as introns by the host's splicosome machinery. If so, site-directed 
mutagenesis of the "intron" region may be conducted to prevent losing a portion of the genetic 
message as a false intron code, Reed and Maniatis, Cell 41:95-105, 1985. 
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The vector can be microinjected directly into plant cells by use of micropipettes to 
mechanically transfer the recombinant DNA. Crossway, Mol. Gen, Genet, 202:179-185, 
1985. The genetic material may also be transferred into the plant cell by using polyethylene 
glycol, Krens, et a!., Nature, 296, 72-74, 1982. Another method of introduction of nucleic 
acid segments is high velocity ballistic penetration by small particles with the nucleic acid 
either within the matrix of small beads or particles, or on the surface, Klein, et al., Nature, 
327, 70-73, 1987 and Knudsen and Muller, 1991, Planta, 185:330-336 teaching particle 
bombardment of barley endosperm to create transgenic barley. Yet another method of 
introduction would be fusion of protoplasts with other entities, either minicells, cells, 
lysosomes or other fusible lipid-surfaced bodies, Fraley, et al., Proa Natl. Acad. Sci. USA, 79, 
1859-1863, 1982. 

The vector may also be introduced into the plant cells by electroporation. (Fromm et 
al., Proa Natl Acad. Sci. USA 82:5824, 1985). In this technique, plant protoplasts are 
electroporated in the presence of plasmids containing the gene construct. Electrical impulses 
of high field strength reversibly permeabilize biomembranes allowing the introduction of the 
plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form plant callus. 

All plants from which protoplasts can be isolated and cultured to give whole 
regenerated plants can be transformed by the present invention so that whole plants are 
recovered which contain the transferred gene. It is known that practically all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of 
sugarcane, sugar beet, cotton, fruit and other trees, legumes and vegetables. Some suitable 
plants include, for example, species from the genera Fragaria, Lotus, Medicago, Onobrychis, 
Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, 
Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersion, 
Nicotiana, Solanum, Petunia, Digitalis, Majorana, Cichorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea, Triticum, 
Sorghum, and Datura. 

Means for regeneration vary from species to species of plants, but generally a 
suspension of transformed protoplasts containing copies of the heterologous gene is first 
provided. Callus tissue is formed and shoots may be induced from callus and subsequently 
rooted. Alternatively, embryo formation can be induced from the protoplast suspension. 
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These embryos germinate as natural embryos to form plants. The culture media will generally 
contain various amino acids and hormones, such as auxin and cytokinins. It is also 
advantageous to add glutamic acid and proline to the medium, especially for such species as 
com and alfalfa. Shoots and roots normally develop simultaneously. Efficient regeneration 
will depend on the medium, on the genotype, and on the history of the culture. If these three 
variables are controlled, then regeneration is fully reproducible and repeatable. 

In some plant cell culture systems, the desired protein of the invention may be 
excreted or alternatively, the protein may be extracted from the whole plant. Where the 
desired protein of the invention is secreted into the medium, it may be collected. 
Alternatively, the embryos and embryoless-half seeds or other plant tissue may be 
mechanically disrupted to release any secreted protein between cells and tissues. The mixture 
may be suspended in a buffer solution to retrieve soluble proteins. Conventional protein 
isolation and purification methods will be then used to purify the recombinant protein. 
Parameters of time, temperature pH, oxygen, and volumes will be adjusted through routine 
methods to optimize expression and recovery of heterologous protein. 

i 

iii. Baculovirus Systems 

The polynucleotide encoding the protein can also be inserted into a suitable insect 
expression vector, and is operably linked to the control elements within that vector. Vector 
construction employs techniques which are known in the art. Generally, the components of 
the expression system include a transfer vector, usually a bacterial plasmid, which contains 
both a fragment of the baculovirus genome, and a convenient restriction site for insertion of 
the heterologous gene or genes to be expressed; a wild type baculovirus with a sequence 
homologous to the baculovirus-specific fragment in the transfer vector (this allows for the 
homologous recombination of the heterologous gene in to the baculovirus genome); and 
appropriate insect host cells and growth media. 

After inserting the DNA sequence encoding the protein into the transfer vector, the 
vector and the wild type viral genome are transfected into an insect host cell where the vector 
and viral genome are allowed to recombine. The packaged recombinant virus is expressed 
and recombinant plaques are identified and purified. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, inter 
alia, Invitrogen, San Diego CA ("MaxBac" kit). These techniques are generally known to 
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those skilled in the art and fully described in Summers and Smith, Texas Agricultural 
Experiment Station Bulletin No. J555 (1987) (hereinafter "Summers and Smith 1 '). 

Prior to inserting the DNA sequence encoding the protein into the baculovirus 
genome, the above described components, comprising a promoter, leader (if desired), coding 
sequence of interest, and transcription termination sequence, are usually assembled into an 
intermediate transplacement construct (transfer vector). This construct may contain a single 
gene and operably linked regulatory elements; multiple genes, each with its owned set of 
operably linked regulatory elements; or multiple genes, regulated by the same set of 
regulatory elements. Intermediate transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance 
in a host, such as a bacterium. The replicon will have a replication system, thus allowing it to 
be maintained in a suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes into 
AcNPV is pAc373. Many other vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters the polyhedrin start codon from 
ATG to ATT, and which introduces a BamHI cloning site 32 basepairs downstream from the 
ATT; see Luckow and Summers, Virology (1989) 1 7:31. 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al. 
(1988) Ann, Rev. Microbiol, 42:111) and a prokaryotic ampicillin-resistance {amp) gene and 
origin of replication for selection and propagation in E. coli. 

Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus 
promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5' to 3') transcription of a coding sequence (e.g., structural gene) 
into mRNA. A promoter will have a transcription initiation region which is usually placed 
proximal to the 5 1 end of the coding sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription initiation site. A baculovirus 
transfer vector may also have a second domain called an enhancer, which, if present, is 
usually distal to the structural gene. Expression may be either regulated or constitutive. 

Structural genes, abundantly transcribed at late times in a viral infection cycle, provide 
particularly useful promoter sequences. Examples include sequences derived from the gene 
encoding the viral polyhedron protein, Friesen et al., (1986) "The Regulation of Baculovirus 
Gene Expression," in: The Molecular Biology of Baculoviruses (ed. Walter Doerfler); EPO 
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Publ. Nos. 127 839 and 155 476; and the gene encoding the plO protein, Vlak et al., (1988), J. 
Gen. Virol. 59:765. 

DNA encoding suitable signal sequences can be derived from genes for secreted insect 
or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) 
Gene, 73:409). Alternatively, since the signals for mammalian cell posttranslational 
modifications (such as signal peptide cleavage, proteolytic cleavage, and phosphorylation) 
appear to be recognized by insect cells, and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, 
leaders of non-insect origin, such as those derived from genes encoding human (alpha) a- 
interferon, Maeda et al., (1985), Nature 315:592; human gastrin-releasing peptide, Lebacq- 
Verheyden et al., (1988), Molec. Cell Biol 5:3129; human IL-2, Smith et al., (1985) Proc. 
Nat'lAcad. Scl USA, £2:8404; mouse IL-3, (Miyajima et al., (1987) Gene 58:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA, 7:99, can also be used to provide for secretion 
in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellularly or, if it is 
expressed with the proper regulatory sequences, it can be secreted. Good intracellular 
expression of nonfused foreign proteins usually requires heterologous genes that ideally have 
a short leader sequence containing suitable translation initiation signals preceding an ATG 
start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein 
by in vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally secreted 
can be secreted from the insect cell by creating chimeric DNA molecules that encode a fusion 
protein comprised of a leader sequence fragment that provides for secretion of the foreign 
protein in insects. The leader sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic 
reticulum. 

After insertion of the DNA sequence and/or the gene encoding the expression product 
precursor of the protein, an insect cell host is co-transformed with the heterologous DNA of 
the transfer vector and the genomic DNA of wild type baculovirus — usually by co- 
transfection. The promoter and transcription termination sequence of the construct will 
usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing 
heterologous DNA into the desired site in the baculovirus virus are known in the art. (See 
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Summers and Smith supra; Ju et al. (1987); Smith et al., Mol Cell Biol. (1983) 5:2156; and 
Luckow and Summers (1989)). For example, the insertion can be into a gene such as the 
polyhedrin gene, by homologous double crossover recombination; insertion can also be into a 
restriction enzyme site engineered into the desired baculovirus gene. Miller et ah, (1989), 
Bioessays 4:9\. The DNA sequence, when cloned in place of the polyhedrin gene in the 
expression vector, is flanked both 5' and 3 f by polyhedrin-specific sequences and is positioned 
downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is subsequently packaged into an 
infectious recombinant baculovirus. Homologous recombination occurs at low frequency 
(between about 1% and about 5%); thus, the majority of the virus produced after 
cotransfection is still wild-type virus. Therefore, a method is necessary to identify 
recombinant viruses. An advantage of the expression system is a visual screen allowing 
recombinant viruses to be distinguished. The polyhedrin protein, which is produced by the 
native virus, is produced at very high levels in the nuclei of infected cells at late times after 
viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain 
embedded particles. These occlusion bodies, up to 15 jxm in size, are highly refractile, giving 
them a bright shiny appearance that is readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies. To distinguish recombinant virus 
from wild-type virus, the transfection supernatant is plaqued onto a monolayer of insect cells 
by techniques known to those skilled in the art. Namely, the plaques are screened under the 
light microscope for the presence (indicative of wild-type virus) or absence (indicative of 
recombinant virus) of occlusion bodies. Current Protocols in Microbiology Vol. 2 (Ausubel 
et al. eds) at 16.8 (Supp. 10, 1990);. Summers- and Smith, supra; Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection into 
several insect cells. For example, recombinant baculoviruses have been developed for, inter 
alia: Aedes aegypti , Autographa californica, Bombyx mori, Drosophila melanogaster, 
Spodoptera frugiperda, and Trichoplusia ni (PCT Pub. No. WO 89/046699; Carbonell et al., 
(1985) y. Virol 56:153; Wright (1986) Nature 527:718; Smith et al., (1983) Mol Cell Biol 
5:2156; and see generally, Fraser, et al (1989) In Vitro Cell Dev. Biol 25:225). 

Cells and cell culture media are commercially available for both direct and fusion 
expression of heterologous polypeptides in a baculovirus/expression system; cell culture 
technology is generally known to those skilled in the art. See, e.g., Summers and Smith supra. 
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The modified insect cells may then be grown in an appropriate nutrient medium, 
which allows for stable maintenance of the plasmid(s) present in the modified insect host. 
Where the expression product gene is under inducible control, the host may be grown to high 
density, and expression induced. Alternatively, where expression is constitutive, the product 
will be continuously expressed into the medium and the nutrient medium must be 
continuously circulated, while removing the product of interest and augmenting depleted 
nutrients. The product may be purified by such techniques as chromatography, e.g., HPLC, 
affinity chromatography, ion exchange chromatography, etc.; electrophoresis; density gradient 
centrifugation; solvent extraction, or the like. As appropriate, the product may be further 
purified, as required, so as to remove substantially any insect proteins which are also secreted 
in the medium or result from lysis of insect cells, so as to provide a product which is at least 
substantially free of host debris, e.g., proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host cells derived from the 
transformants are incubated under conditions which allow expression of the recombinant 
protein encoding sequence. These conditions will vary, dependent upon the host cell selected. 
However, the conditions are readily ascertainable to those of ordinary skill in the art, based 
upon what is known in the art. 

iv. Bacterial Systems 

Bacterial expression techniques are known in the art. A bacterial promoter is any DNA 
sequence capable of binding bacterial RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed proximal to the 5 f end of the coding 
sequence. This transcription initiation region usually includes an RNA polymerase binding 
site and a transcription initiation site. A bacterial promoter may also have a second domain 
called an operator, that may overlap an adjacent RNA polymerase binding site at which RNA 
synthesis begins. The operator permits negative regulated (inducible) transcription, as a gene 
repressor protein may bind the operator and thereby inhibit transcription of a specific gene. 
Constitutive expression may occur in the absence of negative regulatory elements, such as the 
operator. In addition, positive regulation may be achieved by a gene activator protein binding 
sequence, which, if present is usually proximal (5*) to the RNA polymerase binding sequence. 
An example of a gene activator protein is the catabolite activator protein (CAP), which helps 
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initiate transcription of the lac operon in Escherichia coli {E. coli) (Raibaud et aL (1984) 
Annu. Rev. Genet. 75:173). Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful promoter 
sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, 
such as galactose, lactose {lac) (Chang et aL (1977) Nature 198:1056), and maltose. 
Additional examples include promoter sequences derived from biosynthetic enzymes such as 
tryptophan {trp) (Goeddel et aL (1980) Nuc. Acids Res. 5:4057; Yelverton et aL (1981) NucL 
Acids Res. 9:731; U.S. Patent 4,738,921; EPO Publ. Nos. 036 776 and 121 775). The beta- 
lactamase {bid) promoter system (Weissmann (1981) "The cloning of interferon and other 
mistakes." In Interferon 3 (ed. I. Gresser)), bacteriophage lambda PL (Shimatake et aL (1981) 
Nature 292:12%) and T5 (U.S. Patent 4,689,406) promoter systems also provide useful 
promoter sequences. 

In addition, synthetic promoters which do not occur in nature also function as bacterial 
promoters. For example, transcription activation sequences of one bacterial or bacteriophage 
promoter may be joined with the operon sequences of another bacterial or bacteriophage 
promoter, creating a synthetic hybrid promoter (U.S. Patent 4,551,433). For example, the tac 
promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon 
sequences that is regulated by the lac repressor (Amann et al. (1983) Gene 25:167; de Boer et 
aL (1983) Proc. NatL Acad. Sci. 80:21). Furthermore, a bacterial promoter can include 
naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial 
RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial 
origin can also be coupled with a compatible RNA polymerase to produce high levels of 
expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter 
system is an example of a coupled promoter system (Studier et aL (1986) J. Mol. Biol. 
189:113; Tabor et aL (1985) Proc NatL Acad. ScL 82:1074), In addition, a hybrid promoter 
can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ. 
No. 267 851). 

In addition to a functioning promoter sequence, an efficient ribosome binding site is 
also useful for the expression of foreign genes in prokaryotes. In E. coli, the ribosome binding 
site is called the Shine-Dalgarno (SD) sequence and includes an initiation codon (ATG) and a 
sequence 3-9 nucleotides in length located 3-1 1 nucleotides upstream of the initiation codon 
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(Shine et aL (1975) Nature 254:34). The SD sequence is thought to promote binding of 
mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' end of 
E. coli 16S rRNA (Steitz et al. (1979) "Genetic signals and nucleotide sequences in 
messenger RNA." In Biological Regulation and Development: Gene Expression (ed. R.F. 
Goldberger)). To express eukaryotic genes and prokaryotic genes with weak ribosome- 
binding site, it is often necessary to optimize the distance between the SD sequence and the 
ATG of the eukaryotic gene (Sambrook et al. (1989) "Expression of cloned genes in 
Escherichia coli." In Molecular Cloning: A Laboratory Manual). 

A DNA molecule may be expressed intracellularly. A promoter sequence may be 
directly linked with the DNA molecule, in which case the first amino acid at the N-terminus 
will always be a methionine, which is encoded by the ATG start codon. If desired, methionine 
at the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen 
bromide or by either in vivo or in vitro incubation with a bacterial methionine N-terminal 
peptidase (EPO Publ. No. 219 237). 

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous bacterial protein, or other stable protein, is 
fused to the 5* end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the bacteriophage lambda cell 
gene can be linked at the 5' terminus of a foreign gene and expressed in bacteria. The resulting 
fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the 
bacteriophage protein from the foreign gene (Nagai et aL (1984) Nature 509:810). Fusion 
proteins can also be made with sequences from the lacZ (Jia et aL (1987) Gene 60:197), trpE 
(Allen etal. (1987) J. Biotechnol. 5:93; Makoff et al (1989) J. Gen. Microbiol 135:1 1), and 
Chey (EPO Publ. No. 324 647) genes. The DNA sequence at the junction of the two amino 
acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion 
protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site 
for a processing enzyme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin 
from the foreign protein. Through this method, native foreign protein can be isolated (Miller 
et al. (1989) Bio/Technology 7:698). 

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric 
DNA molecules that encode a fusion protein comprised of a signal peptide sequence fragment 
that provides for secretion of the foreign protein in bacteria (U.S. Patent 4,336,336). The 
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signal sequence fragment usually encodes a signal peptide comprised of hydrophobic amino 
acids which direct the secretion of the protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the periplasmic space, located between the 
inner and outer membrane of the cell (gram-negative bacteria). Preferably there are processing 
sites, which can be cleaved either in vivo or in vitro encoded between the signal peptide 
fragment and the foreign gene. 

DNA encoding suitable signal sequences can be derived from genes for secreted 
bacterial proteins, such as the E. coli outer membrane protein gene (ompA) (Masui et al. 
(1983), in: Experimental Manipulation of Gene Expressions Ghrayeb et aL (1984) EMBO J. 
5:2437) and the E. coli alkaline phosphatase signal sequence (phoA) (Oka et al. (1985) Proc. 
Natl Acad. Set 82:7212). As an additional example, the signal sequence of the alpha-amylase 
gene from various Bacillus strains can be used to secrete heterologous proteins from B. 
subtilis (Palva et aL (1982) Proc. Natl Acad. Scl USA 79:5582; EPO Publ. No. 244 042). 

Usually, transcription termination sequences recognized by bacteria are regulatory 
regions located 3 1 to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Transcription termination sequences 
frequently include DNA sequences of about 50 nucleotides capable of forming stem loop 
structures that aid in terminating transcription. Examples include transcription termination 
sequences derived from genes with strong promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Usually, the above described components, comprising a promoter, signal sequence (if 
desired), coding sequence of interest, and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often maintained in a replicon, such as 
an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
bacteria. The replicon will have a replication system, thus allowing it to be maintained in a 
prokaryotic host either for expression or for cloning and amplification. In addition, a replicon 
may be either a high or low copy number plasmid. A high copy number plasmid will 
generally have a copy number ranging from about 5 to about 200, and usually about 1 0 to 
about 150. A host containing a high copy number plasmid will preferably contain at least 
about 10, and more preferably at least about 20 plasmids. Either a high or low copy number 
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vector may be selected, depending upon the effect of the vector and the foreign protein on the 
host. 

Alternatively, the expression constructs can be integrated into the bacterial genome 
with an integrating vector. Integrating vectors usually contain at least one sequence 
homologous to the bacterial chromosome that allows the vector to integrate. Integrations 
appear to result from recombinations between homologous DNA in the vector and the 
bacterial chromosome. For example, integrating vectors constructed with DNA from various 
Bacillus strains integrate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating 
vectors may also be comprised of bacteriophage or transposon sequences. 

Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin 
(neomycin), and tetracycline (Davies et al (1978) Annu. Rev. Microbiol 52:469). Selectable 
markers may also include biosynthetic genes, such as those in the histidine, tryptophan, and 
leucine biosynthetic pathways. 

Alternatively, some of the above described components can be put together in 
transformation vectors. Transformation vectors are usually comprised of a selectable market 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extra-chromosomal replicons or 
integrating vectors, have been developed for transformation into many bacteria. For example, 
expression vectors have been developed for, inter alia, the following bacteria: Bacillus 
subtilis (Palva et al (1982) Proa Natl Acad. ScL USA 79:5582; EPO Publ. Nos. 036 259 and 
063 953; PCT Publ. No. WO 84/04541), Escherichia coli (Shimatake et al (1981) Nature 
292:128; Amann et al (1985) Gene 40:183; Studier et al (1986) J. Mol Biol 189:113; EPO 
Publ. Nos. 036 776, 136 829 and 136 907), Streptococcus cremoris (Powell et al (1988) Appl 
Environ. Microbiol 54:655); Streptococcus lividans (Powell et al (1988) Appl Environ. 
Microbiol 54:655), Streptomyces lividans (U.S. Patent 4,745,056). 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, 
and usually include either the transformation of bacteria treated with CaCb or other agents, 
such as divalent cations and DMSO. DNA can also be introduced into bacterial cells by 
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electroporation. Transformation procedures usually vary with the bacterial species to be 
transformed. (See e.g., use of Bacillus: Masson et al. (1989) F EMS Microbiol Lett. 60:273 ; 
Palva et al (1982) Proc. Natl Acad. ScL USA 79:5582; EPO Publ. Nos. 036 259 and 063 953; 
PCT Publ. No. WO 84/04541; use of Campylobacter: Miller et al. (1988) Proc. Natl. Acad 
ScL 85:856; and Wang et al. (1990) J. Bacterid. 1 72:949; use of Escherichia coli: Cohen et 
al (1973) Proc. Natl Acad. ScL 69:2110; Dower et al. (1988) Nucleic Acids Res. 7(5:6127; 
Kushner (1978) "An improved method for transformation of Escherichia coli with ColEl- 
derived plasmids. In Genetic Engineering; Proceedings of the International Symposium on 
Genetic Engineering (eds. H.W. Boyer and S. Nicosia); Mandel et al. (1970) J. MoL Biol. 
53:159; Taketo (1988) Biochim. Biophys. Acta 949:31%; use of Lactobacillus: Chassy et al. 
(1987) FEMS Microbiol Lett. 44:113; use of Pseudomonas: Fiedler et al. (1988) Anal 
Biochem 170:3%; use of Staphylococcus: Augustin et al (1990) FEMS Microbiol. Lett. 
66:203; use of Streptococcus: Barany et al. (1980) J. BacterioL 144:698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. J. 
Ferretti and R. Curtiss III); Perry et al. (1981) Infect. Immun. 32:1295; Powell et al. (1988) 
Appl. Environ. Microbiol. 54:655; Somkuti et al. (1987) Proc. 4th Evr. Cong. Biotechnology 
7:412. 

v. Yeast Expression 

Yeast expression systems are also known to one of ordinary skill in the art. A yeast 
promoter is any DNA sequence capable of binding yeast RNA polymerase and initiating the 
downstream (3 f ) transcription of a coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which is usually placed proximal to the 5' 
end of the coding sequence. This transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a transcription initiation site. A yeast 
promoter may also have a second domain called an upstream activator sequence (UAS), 
which, if present, is usually distal to the structural gene. The UAS permits regulated 
(inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated 
expression may be either positive or negative, thereby either enhancing or reducing 
transcription. 

Yeast is a fermenting organism with an active metabolic pathway, therefore sequences 
encoding enzymes in the metabolic pathway provide particularly useful promoter sequences. 
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Examples include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044), enolase, 
glucokinase, glucose-6-phosphate isom erase, glyceraldehyde-3-phosphate-dehydrogenase 
(GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and 
pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast PHOS gene, encoding acid 
phosphatase, also provides useful promoter sequences (Myanohara et al (1983) Proc. Natl 
Acad, Set USA 80:1). 

In addition, synthetic promoters which do not occur in nature also function as yeast 
promoters. For example, UAS sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to 
the GAP transcription activation region (U.S. Patent Nos. 4,876,197 and 4,880,734). Other 
examples of hybrid promoters include promoters which consist of the regulatory sequences of 
either the ADH2, GAL4, GAL 10, OR PHOS genes, combined with the transcriptional 
activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 556). 
Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of 
such promoters include, inter alia, (Cohen et al. (1980) Proc. Natl Acad. Set USA 77:1078; 
Henikoff et al. (1981) Nature 253:835; Hollenberg et al. (1981) Curr. Topics Microbiol 
Immunol 96:119; Hollenberg et al. (1979) "The Expression of Bacterial Antibiotic Resistance 
Genes in the Yeast Saccharomyces cerevisiae," in: Plasmids of Medical Environmental and 
Commercial Importance (eds. K.N. Timmis and A. Puhler); Mercerau-Puigalon et al (1980) 
Gene 77:163; Panthier et al (1980) Curr. Genet. 2:109;). 

A DNA molecule may be expressed intracellular^ in yeast. A promoter sequence may 
be directly linked with the DNA molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by 
in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in 
mammalian, plant, baculovirus, and bacterial expression systems. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the yeast or human 
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superoxide dismutase (SOD) gene, can be linked at the 5' terminus of a foreign gene and 
expressed in yeast. The DNA sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. See e.g., EPO Publ. No. 196056. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that 
preferably retains a site for a processing enzyme (e.g. ubiquitin-specific processing protease) 
to cleave the ubiquitin from the foreign protein. Through this method, therefore, native 
foreign protein can be isolated (e.g., WO88/024066). 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provide for secretion in yeast of the foreign protein. Preferably, there 
are processing sites encoded between the leader fragment and the foreign gene that can be 
cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell. 

DNA encoding suitable signal sequences can be derived from genes for secreted yeast 
proteins, such as the yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
62:096,086) and the A-factor gene (U.S. Patent 4,588,684). Alternatively, leaders of non- 
yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO 
Publ. No. 060 057). 

A preferred class of secretion leaders are those that employ a fragment of the yeast 
alpha-factor gene, which contains both a "pre" signal sequence, and a "pro" region. The types 
of alpha- factor fragments that can be employed include the full-length pre-pro alpha factor 
leader (about 83 amino acid residues) as well as truncated alpha-factor leaders (usually about 
25 to about 50 amino acid residues) (U.S. Patent Nos. 4,546,083 and 4,870,008; EPO Publ. 
No. 324 274). Additional leaders employing an alpha-factor leader fragment that provides for 
secretion include hybrid alpha- factor leaders made with a presequence of a first yeast, but a 
pro-region from a second yeast alphafactor. (See e.g., PCT Publ. No. WO 89/02463.) 

Usually, transcription termination sequences recognized by yeast are regulatory 
regions located 3 ! to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
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sequence and other yeast-recognized termination sequences, such as those coding for 
glycolytic enzymes. 

Usually, the above described components, comprising a promoter, leader (if desired), 
coding sequence of interest, and transcription termination sequence, are put together into 
expression constructs. Expression constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
yeast or bacteria. The replicon may have two replication systems, thus allowing it to be 
maintained, for example, in yeast for expression and in a prokaryotic host for cloning and 
amplification. Examples of such yeast-bacteria shuttle vectors include YEp24 (Botstein et al. 
(1979) Gene 5:17-24), pCl/1 (Brake et al. (1984) Proc. Natl. Acad. Sci USA 57:4642-4646), 
and YRpl7 (Stinchcomb et al. (1982) J. Mol. Biol. 158:157). In addition, a replicon may be 
either a high or low copy number plasmid. A high copy number plasmid will generally have a 
copy number ranging from about 5 to about 200, and usually about 10 to about 150. A host 
containing a high copy number plasmid will preferably have at least about 10, and more 
preferably at least about 20. Enter a high or low copy number vector may be selected, 
depending upon the effect of the vector and the foreign protein on the host. See e.g., Brake et 
al., supra. 

Alternatively, the expression constructs can be integrated into the yeast genome with 
an integrating vector. Integrating vectors usually contain at least one sequence homologous to 
a yeast chromosome that allows the vector to integrate, and preferably contain two 
homologous sequences flanking the expression construct. Integrations appear to result from 
recombinations between homologous DNA in the vector and the yeast chromosome (Orr- 
Weaver et al. (1983) Methods in Enzymol. 707:228-245). An integrating vector may be 
directed to a specific locus in yeast by selecting the appropriate homologous sequence for 
inclusion in the vector. See Orr-Weaver et al., supra. One or more expression construct may 
integrate, possibly affecting levels of recombinant protein produced (Rine et al. (1983) Proc. 
Natl Acad. Sci. USA 50:6750). The chromosomal sequences included in the vector can occur 
either as a single segment in the vector, which results in the integration of the entire vector, or 
two segments homologous to adjacent segments in the chromosome and flanking the 
expression construct in the vector, which can result in the stable integration of only the 
expression construct. 
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Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can be expressed in the yeast host, 
such as ADE2, HIS4, LEU2, TRPJ 9 and ALG7, and the G41 8 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G41 8, respectively. In addition, a suitable 
selectable marker may also provide yeast with the ability to grow in the presence of toxic 
compounds, such as metal. For example, the presence of CUP1 allows yeast to grow in the 
presence of copper ions (Butt et al. (1987) Microbiol Rev. 57:351). 

Alternatively, some of the above described components can be put together into 
transformation vectors. Transformation vectors are usually comprised of a selectable marker 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extrachromosomal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For example, 
expression vectors and methods of introducing exogenous DNA into yeast hosts have been 
developed for, inter alia, the following yeasts: Candida albicans (Kurtz, et aL (1986) Mol 
Cell Biol (5:142); Candida maltosa (Kunze, et al (1985) J. Basic Microbiol 25:141); 
Hansenula polymorpha (Gleeson, et al (1986) J. Gen. Microbiol 732:3459; Roggenkamp et 
al (1986) Mol Gen. Genet. 202:302); Kluyveromyces fragilis (Das, et al (1984) J. Bacteriol 
158:1 165); Kluyveromyces lactis (De Louvencourt et al (1983) J. Bacteriol 154:131; Van 
den Berg et al. (1990) Bio/Technology #:135); Pichia guillerimondii (Kunze et al. (1985) J. 
Basic Microbiol 25:141); Pichia pastoris (Cregg, et aL (1985) Mol Cell Biol 5:3376; U.S. 
Patent Nos. 4,837,148 and 4,929,555); Saccharomyces cerevisiae (Hinnen et aL (1978) Proc. 
Natl. Acad. Set USA 75:1929; Ito et al (1983) J. Bacteriol. 753:163); Schizosaccharomyces 
pombe (Beach and Nurse (1981) Nature 300:106); and Yarrowia lipolytica (Davidow, et al 
(1985) Curr. Genet. 70:380471 Gaillardin, et al (1985) Curr. Genet 70:49). 

Methods of introducing exogenous DNA into yeast hosts are well-known in the art, 
and usually include either the transformation of spheroplasts or of intact yeast cells treated 
with alkali cations. Transformation procedures usually vary with the yeast species to be 
transformed. See e.g., [Kurtz et aL (1986) Mol Cell Biol (5:142; Kunze et al (1985) J. Basic 
Microbiol 25:141; Candida]; [Gleeson al. (1986) J. Gen. Microbiol 732:3459; 
Roggenkamp et al (1986) MoL Gen. Genet. 202:302; Hansenula]; [Das et al. (1984) J. 
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Bacteriol 755:1165; De Louvencourt et al. (1983) J. BacterioL 754:1 165; Van den Berg et al. 
(1990) Bio/Technology 5:135; Kluyveromyces]; [Cregg et al, (1985) Mol Cell, Biol. 5:3376; 
Kunze et al (1985)7. Basic Microbiol. 25:141; U.S. Patent Nos. 4,837,148 and 4,929,555; 
Pichia]; [Hinnen et al (1978) Proc. Natl. Acad. ScL USA 75;1929; Ito et al, (1983) J. 
Bacteriol. 755:163 Saccharomyces]; [Beach and Nurse (1981) Nature 300:706; 
Schizosaccharomyces]; [Davidow et al. (1985) Curr, Genet 70:39; Gaillardin et al, (1985) 
Curr. Genet. 70:49; Yarrowia]. 

Definitions 

A composition containing X is "substantially free of Y when at least 85% by weight 
of the total X+Y in the composition is X. Preferably, X comprises at least about 90% by 
weight of the total of X+Y in the composition, more preferably at least about 95% or even 
99% by weight. 

A "conserved" Neisseria amino acid fragment or protein is one that is present in a 
particular Neisserial protein in at least x% of Neisseria. The value of x may be 50% or more, 
e.g., 66%, 75%, 80%, 90%, 95% or even 100% (i.e. the amino acid is found in the protein in 
question in all Neisseria). In order to determine whether an animo acid is "conserved" in a 
particular Neisserial protein, it is necessary to compare that amino acid residue in the 
sequences of the protein in question from a plurality of different Neisseria (a reference 
population). The reference population may include a number of different Neisseria species 
or may include a single species. The reference population may include a number of different 
serogroups of a particular species or a single serogroup. A preferred reference population 
consists of the 5 most common Neisseria strains. 

The term "heterologous" refers to two biological components that are not found 
together in nature. The components may be host cells, genes, or regulatory regions, such as 
promoters. Although the heterologous components are not found together in nature, they can 
function together, as when a promoter heterologous to a gene is operably linked to the gene. 
Another example is where a Neisserial sequence is heterologous to a mouse host cell. 

"Epitope" means antigenic determinant, and may elicit a cellular and/or humoral 
response. 

Conditions for "high stringency" are 65 degrees C in 0.1 xSSC 0.5% SDS solution. 
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An "origin of replication" is a polynucleotide sequence that initiates and regulates 
replication of polynucleotides, such as an expression vector. The origin of replication behaves 
as an autonomous unit of polynucleotide replication within a cell, capable of replication under 
its own control. An origin of replication may be needed for a vector to replicate in a particular 
host cell. With certain origins of replication, an expression vector can be reproduced at a high 
copy number in the presence of the appropriate proteins within the cell. Examples of origins 
are the autonomously replicating sequences, which are effective in yeast; and the viral 
T-antigen, effective in COS-7 cells. 

A "mutant" sequence is defined as a DNA, RNA or amino acid sequence differing 
from but having homology with the native or disclosed sequence. Depending on the 
particular sequence, the degree of homology (sequence identity) between the native or 
disclosed sequence and the mutant sequence is preferably greater than 50% (e.g., 60%, 70%, 
80%, 90%, 95%, 99% or more) which is calculated as described above. As used herein, an 
"allelic variant" of a nucleic acid molecule, or region, for which nucleic acid sequence is 
provided herein is a nucleic acid molecule, or region, that occurs at essentially the same locus 
in the genome of another or second isolate, and that, due to natural variation caused by, for 
example, mutation or recombination, has a similar but not identical nucleic acid sequence. A 
coding region allelic variant typically encodes a protein having similar activity to that of the 
protein encoded by the gene to which it is being compared. An allelic variant can also 
comprise an alteration in the 5* or 3' untranslated regions of the gene, such as in regulatory 
control regions, (see, for example, U.S. Patent 5,753,235). 

Antibodies 

As used herein, the term "antibody" refers to a polypeptide or group of polypeptides 
composed of at least one antibody combining site. An "antibody combining site" is the 
three-dimensional binding space with an internal surface shape and charge distribution 
complementary to the features of an epitope of an antigen, which allows a binding of the 
antibody with the antigen. "Antibody" includes, for example, vertebrate antibodies, hybrid 
antibodies, chimeric antibodies, humanized antibodies, altered antibodies, univalent 
antibodies, Fab proteins, and single domain antibodies. 

Antibodies against the proteins of the invention are useful for affinity 
chromatography, immunoassays, and distinguishing/identifying Neisseria menB proteins. 
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Antibodies elicited against the proteins of the present invention bind to antigenic polypeptides 
or proteins or protein fragments that are present and specifically associated with strains of 
Neisseria meningitidis menB. In some instances, these antigens may be associated with 
specific strains, such as those antigens specific for the menB strains. The antibodies of the 
invention may be immobilized to a matrix and utilized in an immunoassay or on an affinity 
chromatography column, to enable the detection and/or separation of polypeptides, proteins or 
protein fragments or cells comprising such polypeptides, proteins or protein fragments. 
Alternatively, such polypeptides, proteins or protein fragments may be immobilized so as to 
detect antibodies bindably specific thereto. 

Antibodies to the proteins of the invention, both polyclonal and monoclonal, may be 
prepared by conventional methods. In general, the protein is first used to immunize a suitable 
animal, preferably a mouse, rat, rabbit or goat. Rabbits and goats are preferred for the 
preparation of polyclonal sera due to the volume of serum obtainable, and the availability of 
labeled anti-rabbit and anti-goat antibodies. Immunization is generally performed by mixing 
or emulsifying the protein in saline, preferably in an adjuvant such as Freund's complete 
adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 ng/injection is typically sufficient. Immunization is 
generally boosted 2-6 weeks later with one or more injections of the protein in saline, 
preferably using Freund's incomplete adjuvant. One may alternatively generate antibodies by 
in vitro immunization using methods known in the art, which for the purposes of this 
invention is considered equivalent to in vivo immunization. Polyclonal antisera is obtained by 
bleeding the immunized animal into a glass or plastic container, incubating the blood at 25°C 
for one hour, followed by incubating at 4°C for 2-18 hours. The serum is recovered by 
centrifugation (e.g., l,000g* for 10 minutes). About 20-50 ml per bleed may be obtained from 
rabbits. 

Monoclonal antibodies are prepared using the standard method of Kohler & Milstein 
{Nature (1975) 256:495-96), or a modification thereof. Typically, a mouse or rat is 
immunized as described above. However, rather than bleeding the animal to extract serum, 
the spleen (and optionally several large lymph nodes) is removed and dissociated into single 
cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent 
cells) by applying a cell suspension to a plate or well coated with the protein antigen. B-cells 
that express membrane-bound immunoglobulin specific for the antigen bind to the plate, and 
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are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, "HAT"). The 
resulting hybridomas are plated by limiting dilution, and are assayed for the production of 
antibodies which bind specifically to the immunizing antigen (and which do not bind to 
unrelated antigens). The selected MAb-secreting hybridomas are then cultured either in vitro 
(e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive 
atoms (particularly 32 P and 125 I), electron-dense reagents, enzymes, and ligands having 
specific binding partners. Enzymes are typically detected by their activity. For example, 
horseradish peroxidase is usually detected by its ability to convert 
3,3\5,5'-tetramethylbenzidine (TMB) to a blue pigment, quantifiable with a 
spectrophotometer. "Specific binding partner" refers to a protein capable of binding a ligand 
molecule with high specificity, as for example in the case of an antigen and a monoclonal 
antibody specific therefor. Other specific binding partners include biotin and avidin or 
streptavidin, IgG and protein A, and the numerous receptor-ligand couples known in the art. It 
should be understood that the above description is not meant to categorize the various labels 
into distinct classes, as the same label may serve in several different modes. For example, 125 I 
may serve as a radioactive label or as an electron-dense reagent. HRP may serve as enzyme or 
as antigen for a MAb. Further, one may combine various labels for desired effect. For 
example, MAbs and avidin also require labels in the practice of this invention: thus, one 
might label a MAb with biotin, and detect its presence with avidin labeled with l25 I, or with 
an anti-biotin MAb labeled with HRP. Other permutations and possibilities will be readily 
apparent to those of ordinary skill in the art, and are considered as equivalents within the 
scope of the instant invention. 

Antigens, immunogens, polypeptides, proteins or protein fragments of the present 
invention elicit formation of specific binding partner antibodies. These antigens, 
immunogens, polypeptides, proteins or protein fragments of the present invention comprise 
immunogenic compositions of the present invention. Such immunogenic compositions may 
further comprise or include adjuvants, carriers, or other compositions that promote or enhance 
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or stabilize the antigens, polypeptides, proteins or protein fragments of the present invention. 
Such adjuvants and carriers will be readily apparent to those of ordinary skill in the art. 

Pharmaceutical Compositions 

Pharmaceutical compositions can comprise (include) either polypeptides, antibodies, 
or nucleic acid of the invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature, when given to a patient that is febrile. The 
precise effective amount for a subject will depend upon the subject's size and health, the 
nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be determined by routine 
experimentation and is within the judgment of the clinician. 

For purposes of the present invention, an effective dose will be from about 0.01 mg/ 
kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to 
which it is administered. 

A pharmaceutical composition can also contain a pharmaceutical^ acceptable carrier. 
The term "pharmaceutical^ acceptable carrier" refers to a carrier for administration of a 
therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 
The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
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organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutical^ acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., NJ. 1991). 

Pharmaceutical^ acceptable carriers in therapeutic compositions may contain liquids 
such as water, saline, glycerol and ethanol. Additionally, auxiliary substances, such as wetting 
or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles. 
Typically, the therapeutic compositions are prepared as injectables, either as liquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to 
injection may also be prepared. Liposomes are included within the definition of a 
pharmaceutical^ acceptable carrier. 

Delivery Methods 

Once formulated, the compositions of the invention can be administered directly to the 
subject. The subjects to be treated can be animals; in particular, human subjects can be 
treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneal^, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a lesion. Other 
modes of administration include oral and pulmonary administration, suppositories, and 
transdermal and transcutaneous applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

Vaccines 

Vaccines according to the invention may either be prophylactic (i.e., to prevent 
infection) or therapeutic (i.e., to treat disease after infection). 

Such vaccines comprise immunizing antigen(s) or immunogen(s), immunogenic 
polypeptide, protein(s) or protein fragments, or nucleic acids (e.g., ribonucleic acid or 
deoxyribonucleic acid), usually in combination with "pharmaceutically acceptable carriers," 
which include any carrier that does not itself induce the production of antibodies harmful to 
the individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or 
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liposomes), and inactive virus particles. Such carriers are well known to those of ordinary 
skill in the art. Additionally, these carriers may function as immunostimulating agents 
("adjuvants"). Furthermore, the immunogen or antigen may be conjugated to a bacterial 
toxoid, such as a toxoid from diphtheria, tetanus, cholera, H. pylori, etc. pathogens. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum phosphate, 
aluminum sulfate, etc; (2) oil-in-water emulsion formulations (with or without other specific 
immunostimulating agents such as muramyl peptides (see below) or bacterial cell wall 
components), such as for example (a) MF59 (PCT Publ. No. WO 90/14837), containing 5% 
Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts of 
MTP-PE (see below), although not required) formulated into submicron particles using a 
microfluidizer such as Model HOY micro fluidizer (Microfluidics, Newton, MA), (b) SAF, 
containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP 
(see below) either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi Immunochem, Hamilton, 
MT) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall 
components from the group consisting of monophosphorylipid A (MPL), trehalose 
dimycolate (TDM), and ceil wall skeleton (CWS), preferably MPL + CWS (Detox™); 

(3) saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be 
used or particles generated therefrom such as ISCOMs (immunostimulating complexes); 

(4) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant (IF A); 

(5) cytokines, such as interleukins (e.g., IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), 
interferons (e.g„ gamma interferon), macrophage colony stimulating factor (M-CSF), tumor 
necrosis factor (TNF), etc; (6) detoxified mutants of a bacterial ADP-ribosylating toxin such 
as a cholera toxin (CT), a pertussis toxin (PT), or an E. coli heat-labile toxin (LT), particularly 
LT-K63, LT-R72, CT-S109, PT-K9/G129; see, e.g., WO 93/13302 and WO 92/19265; and 
(7) other substances that act as immunostimulating agents to enhance the effectiveness of the 
composition. Alum and MF59 are preferred. 

As mentioned above, muramyl peptides include, but are not limited to, N-acetyl- 
muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D- 
isoglutamine (nor-MDP), N-acetylm\iramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(r-2'- 
dipalmitoyl-5w-glycero-3-huydroxyphosphoryloxy)-ethylamine (MTP-PE), eta 
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The vaccine compositions comprising immunogenic compositions (e.g., which may 
include the antigen, pharmaceutically acceptable carrier, and adjuvant) typically will contain 
diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such 
as wetting or emulsifying agents, pH buffering substances, and the like, may be present in 
such vehicles. Alternatively, vaccine compositions comprising immunogenic compositions 
may comprise an antigen, polypeptide, protein, protein fragment or nucleic acid in a 
pharmaceutically acceptable carrier. 

More specifically, vaccines comprising immunogenic compositions comprise an 
immunologically effective amount of the immunogenic polypeptides, as well as any other of 
the above-mentioned components, as needed. By "immunologically effective amount", it is 
meant that the administration of that amount to an individual, either in a single dose or as part 
of a series, is effective for treatment or prevention. This amount varies depending upon the 
health and physical condition of the individual to be treated, the taxonomic group of 
individual to be treated (e.g., nonhuman primate, primate, etc.) y the capacity of the 
individual's immune system to synthesize antibodies, the degree of protection desired, the 
formulation of the vaccine, the treating doctor's assessment of the medical situation, and other 
relevant factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

Typically, the vaccine compositions or immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or 
suspension in, liquid vehicles prior to injection may also be prepared. The preparation also 
may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed 
above under pharmaceutically acceptable carriers. 

The immunogenic compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Additional formulations suitable for other 
modes of administration include oral and pulmonary formulations, suppositories, and 
transdermal and transcutaneous applications. Dosage treatment may be a single dose schedule 
or a multiple dose schedule. The vaccine may be administered in conjunction with other 
immunoregulatory agents. 

As an alternative to protein-based vaccines, DNA vaccination may be employed (e.g., 
Robinson & Torres (1997) Seminars in Immunology 9:271-283; Donnelly et al (1997) Annu 
Rev Immunol 15:617-648). 
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Gene Delivery Vehicles 

Gene therapy vehicles for delivery of constructs, including a coding sequence of a 
therapeutic of the invention, to be delivered to the mammal for expression in the mammal, can 
be administered either locally or systemically. These constructs can utilize viral or non- viral 
vector approaches in in vivo or ex vivo modality. Expression of such coding sequence can be 
induced using endogenous mammalian or heterologous promoters. Expression of the coding 
sequence in vivo can be either constitutive or regulated. 

The invention includes gene delivery vehicles capable of expressing the contemplated 
nucleic acid sequences. The gene delivery vehicle is preferably a viral vector and, more 
preferably, a retroviral, adenoviral, adeno-associated viral (AAV), herpes viral, or alphavirus 
vector. The viral vector can also be an astrovirus, coronavirus, orthomyxovirus, papovavirus, 
paramyxovirus, parvovirus, picornavirus, poxvirus, or togavirus viral vector. See generally, 
Jolly (1994) Cancer Gene Therapy 1:51-64; Kimura (1994) Human Gene Therapy 5:845-852; 
Connelly (1995) Human Gene Therapy 6:185-193; and Kaplitt (1994) Nature Genetics 
6:148-153. 

Retroviral vectors are well known in the art, including B, C and D type retroviruses, 
xenotropic retroviruses (for example, NZB-X1, NZB-X2 and NZB9-1 (see O'Neill (1985) J. 
Virol 53:160) polytropic retroviruses e.g., MCF and MCF-MLV (see Kelly (1983) J. Virol 
45:291), spumaviruses and lentiviruses. See RNA Tumor Viruses, Second Edition, Cold 
Spring Harbor Laboratory, 1985. 

Portions of the retroviral gene therapy vector may be derived from different 
retroviruses. For example, retrovector LTRs may be derived from a Murine Sarcoma Virus, a 
tRNA binding site from a Rous Sarcoma Virus, a packaging signal from a Murine Leukemia 
Virus, and an origin of second strand synthesis from an Avian Leukosis Virus. 

These recombinant retroviral vectors may be used to generate transduction competent 
retroviral vector particles by introducing them into appropriate packaging cell lines (see US 
patent 5,591,624). Retrovirus vectors can be constructed for site-specific integration into host 
cell DNA by incorporation of a chimeric integrase enzyme into the retroviral particle (see 
W096/37626). It is preferable that the recombinant viral vector is a replication defective 
recombinant virus. 
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Packaging cell lines suitable for use with the above-described retrovirus vectors are 
well known in the art, are readily prepared (see WO95/30763 and WO92/05266), and can be 
used to create producer cell lines (also termed vector cell lines or "VCLs") for the production 
of recombinant vector particles. Preferably, the packaging cell lines are made from human 
parent cells (e.g., HT1080 cells) or mink parent cell lines, which eliminates inactivation in 
human serum. 

Preferred retroviruses for the construction of retroviral gene therapy vectors include 
Avian Leukosis Virus, Bovine Leukemia, Virus, Murine Leukemia Virus, Mink-Cell 
Focus-Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis Virus and Rous 
Sarcoma Virus. Particularly preferred Murine Leukemia Viruses include 407 OA and 1504A 
(Hartley and Rowe (1976) J Virol 19:19-25), Abelson (ATCC No. VR-999), Friend (ATCC 
No. VR-245), Graffi, Gross (ATCC Nol VR-590), Kirsten, Harvey Sarcoma Virus and 
Rauscher (ATCC No. VR-998) and Moloney Murine Leukemia Virus (ATCC No. VR-190). 
Such retroviruses may be obtained from depositories or collections such as the American 
Type Culture Collection ("ATCC") in Rockville, Maryland or isolated from known sources 
using commonly available techniques. 

Exemplary known retroviral gene therapy vectors employable in this invention include 
those described in patent applications GB2200651, EP0415731, EP0345242, EP0334301, 
WO89/02468; WO89/05349, WO89/09271, WO90/02806, WO90/07936, WO94/03622, 
W093/25698, W093/25234, W093/1 1230, WO93/10218, WO91/02805, WO91/02825, 
WO95/07994, US 5,219,740, US 4,405,712, US 4,861,719, US 4,980,289, US 4,777,127, US 
5,591,624. See also Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 
53:962-967; Ram (1993) Cancer Res 53 (1993) 83-88; Takamiya (1992) JNeurosci Res 
33:493-503; Baba (1993) JNeurosurg 79:729-735; Mann (1983) Cell 33:153; Cane (1984) 
Proc Natl Acad Sci 81:6349; and Miller (1990) Human Gene Therapy 1. 

Human adenoviral gene therapy vectors are also known in the art and employable in 
this invention. See, for example, Berkner (1988) Biotechniques 6:616 and Rosenfeld (1991) 
Science 252:431, and WO93/07283, WO93/06223, and WO93/07282. Exemplary known 
adenoviral gene therapy vectors employable in this invention include those described in the 
above referenced documents and in W094/12649, WO93/03769, W093/19191, 
W094/28938, W095/11984, WO95/00655, WO95/27071, W095/29993, W095/34671, 
WO96/05320, WO94/08026, WO94/11506, WO93/06223, WQ94/24299, WO95/14102, 
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W095/24297, WO95/02697, W094/28152, W094/24299, WO95/09241, WO95/25807, 
WO95/05835, W094/18922 and WO95/09654. Alternatively, administration of DNA linked 
to killed adenovirus as described in Curiel (1992) Hum. Gene Ther. 3:147-154 may be 
employed. The gene delivery vehicles of the invention also include adenovirus associated 
virus (AAV) vectors. Leading and preferred examples of such vectors for use in this invention 
are the AAV-2 based vectors disclosed in Srivastava, WO93/09239. Most preferred AAV 
vectors comprise the two AAV inverted terminal repeats in which the native D-sequences are 
modified by substitution of nucleotides, such that at least 5 native nucleotides and up to 18 
native nucleotides, preferably at least 10 native nucleotides up to 18 native nucleotides, most 
preferably 10 native nucleotides are retained and the remaining nucleotides of the D-sequence 
are deleted or replaced with non-native nucleotides. The native D-sequences of the AAV 
inverted terminal repeats are sequences of 20 consecutive nucleotides in each AAV inverted 
terminal repeat (i.e., there is one sequence at each end) which are not involved in HP 
formation. The non-native replacement nucleotide may be any nucleotide other than the 
nucleotide found in the native D-sequence in the same position. Other employable exemplary 
AAV vectors are pWP-19, pWN-1, both of which are disclosed in Nahreini (1993) Gene 
124:257-262. Another example of such an AAV vector is psub201 (see Samulski (1987) J. 
Virol 61:3096). Another exemplary AAV vector is the Double-D ITR vector. Construction of 
the Double-D ITR vector is disclosed in US Patent 5,478,745. Still other vectors are those 
disclosed in Carter US Patent 4,797,368 and Muzyczka US Patent 5,139,941, Chartejee US 
Patent 5,474,935, and Kotin W094/288157. Yet a further example of an AAV vector 
employable in this invention is SSV9AFABTKneo, which contains the AFP enhancer and 
albumin promoter and directs expression predominantly in the liver. Its structure and 
construction are disclosed in Su (1996) Human Gene Therapy 7:463-470. Additional AAV 
gene therapy vectors are described in US 5,354,678, US 5,173,414, US 5,139,941, and US 
5,252,479. 

The gene therapy vectors comprising sequences of the invention also include herpes 
vectors. Leading and preferred examples are herpes simplex virus vectors containing a 
sequence encoding a thymidine kinase polypeptide such as those disclosed in US 5,288,641 
and EP0176170 (Roizman). Additional exemplary herpes simplex virus vectors include 
HFEM/ICP6-LacZ disclosed in WO95/04139 (Wistar Institute), pHSVlac described in Geller 
(1988) Science 241 :1667~1669 and in WO90/09441 and WO92/07945, HSV Us3::pgC-lacZ 
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described in Fink (1992) Human Gene Therapy 3:1 1-19 and HSV 7134, 2 RH 105 and GAL4 
described in EP 0453242 (Breakefield), and those deposited with the ATCC as accession 
numbers ATCC VR-977 and ATCC VR-260. 

Also contemplated are alpha virus gene therapy vectors that can be employed in this 
invention. Preferred alpha virus vectors are Sindbis viruses vectors. Togaviruses, Semliki 
Forest virus (ATCC VR-67; ATCC VR-1247), Middleberg virus (ATCC VR-370), Ross 
River virus (ATCC VR-373; ATCC VR-1 246), Venezuelan equine encephalitis virus (ATCC 
VR923; ATCC VR-1 250; ATCC VR-1249; ATCC VR-532), and those described in US 
patents 5,091,309, 5,217,879, and WO92/10578. More particularly, those alpha virus vectors 
described in U.S. Serial No. 08/405,627, filed March 15, 1995 5 W094/21792, WO92/10578, 
WO95/07994, US 5,091,309 and US 5,217,879 are employable. Such alpha viruses may be 
obtained from depositories or collections such as the ATCC in Rockville, Maryland or 
isolated from known sources using commonly available techniques. Preferably, alphavirus 
vectors with reduced cytotoxicity are used (see USSN 08/679640). 

DNA vector systems such as eukarytic layered expression systems are also useful for 
expressing the nucleic acids of the invention. SeeWO95/07994 for a detailed description of 
eukaryotic layered expression systems. Preferably, the eukaryotic layered expression systems 
of the invention are derived from alphavirus vectors and most preferably from Sindbis viral 
vectors. 

Other viral vectors suitable for use in the present invention include those derived from 
poliovirus, for example ATCC VR-58 and those described in Evans, Nature 339 (1989) 385 
and Sabin (1973) J. Biol Standardization 1:115; rhinovirus, for example ATCC VR-1 110 and 
those described in Arnold (1990) J Cell Biochem L401 ; pox viruses such as canary pox virus 
or vaccinia virus, for example ATCC VR-1 1 1 and ATCC VR-2010 and those described in 
Fisher-Hoch (1 989) Proc Natl Acad Sci 86:3 1 7; Flexner (1 989) Ann NY Acad Sci 569:86, 
Flexner (1990) Vaccine 8:17; in US 4,603,1 12 and US 4,769,330 and WO89/01973; SV40 
virus, for example ATCC VR-305 and those described in Mulligan (1979) Nature 277:108 
and Madzak (1992) J Gen Virol 73:1533; influenza virus, for example ATCC VR-797 and 
recombinant influenza viruses made employing reverse genetics techniques as described in 
US 5,166,057 and in Enami (1990) Proc Natl Acad Sci 87:3802-3805; Enami & Palese (1991) 
J Virol 65:271 1-2713 and Luytjes (1989) Cell 59:1 10, (see also McMichael (1983) NEJMed 
309:13, and Yap (1978) Nature 273:238 and Nature (1979) 277:108); human 
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immunodeficiency virus as described in EP-0386882 and in Buchschacher (1992) J. Virol 
66:2731; measles virus, for example ATCC VR-67 and VR-1247 and those described in EP- 
0440219; Aura virus, for example ATCC VR-368; Bebaru virus, for example ATCC VR-600 
and ATCC VR-1240; Cabassou virus, for example ATCC VR-922; Chikungunya virus, for 
example ATCC VR-64 and ATCC VR-1241 ; Fort Morgan Virus, for example ATCC 
VR-924; Getah virus, for example ATCC VR-369 and ATCC VR-1243; Kyzylagach virus, 
for example ATCC VR-927; Mayaro virus, for example ATCC VR-66; Mucambo virus, for 
example ATCC VR-580 and ATCC VR-1244; Ndumu virus, for example ATCC VR-371; 
Pixuna virus, for example ATCC VR-372 and ATCC VR-1245; Tonate virus, for example 
ATCC VR-925; Triniti virus, for example ATCC VR-469; Una virus, for example ATCC 
VR-374; Whataroa virus, for example ATCC VR-926; Y-62-33 virus, for example ATCC 
VR-375; O'Nyong virus, Eastern encephalitis virus, for example ATCC VR-65 and ATCC 
VR-1242; Western encephalitis virus, for example ATCC VR-70, ATCC VR-1251, ATCC 
VR-622 and ATCC VR-1252; and coronavirus, for example ATCC VR-740 and those 
described in Hamre (1 966) Proc Soc Exp Biol Med 121 : 190. 

Delivery of the compositions of this invention into cells is not limited to the above 
mentioned viral vectors. Other delivery methods and media may be employed such as, for 
example, nucleic acid expression vectors, polycationic condensed DNA linked or unlinked to 
killed adenovirus alone, for example see US Serial No. 08/366,787, filed December 30, 1994 
and Curiel (1992) Hum Gene Ther 3:147-154 ligand linked DNA, for example see Wu (1989) 
J Biol Chem 264:16985-16987, eucaryotic cell delivery vehicles cells, for example see US 
Serial No.08/240,030, filed May 9, 1994, and US Serial No. 08/404,796, deposition of 
photopolymerized hydrogel materials, hand-held gene transfer particle gun, as described in 
US Patent 5,149,655, ionizing radiation as described in US5,206,152 and in W092/1 1033, 
nucleic charge neutralization or fusion with cell membranes. Additional approaches are 
described in Philip (1994) Mol Cell Biol 14:241 1-2418 and in Woffendin (1994) Proc Natl 
AcadSci 91:1581-1585. 

Particle mediated gene transfer may be employed, for example see US Serial No. 
60/023,867. Briefly, the sequence can be inserted into conventional vectors that contain 
conventional control sequences for high level expression, and then incubated with synthetic 
gene transfer molecules such as polymeric DNA-binding cations like polylysine, protamine, 
and albumin, linked to cell targeting ligands such as asialoorosomucoid, as described in Wu & 
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Wu (1987) J. Biol Chem. 262:4429-4432, insulin as described in Hucked (1990) Biochem 
Pharmacol 40:253-263, galactose as described in Plank (1992) Bioconjugate Chem 
3:533-539, lactose or transferrin. 

Naked DNA may also be employed to transform a host cell. Exemplary naked DNA 
introduction methods are described in WO 90/1 1092 and US 5,580,859. Uptake efficiency 
may be improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method may be improved 
further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. 

Liposomes that can act as gene delivery vehicles are described in U.S. 5,422,120, 
W095/13796, W094/23697, W091/14445 and EP-524,968. As described in USSN. 
60/023,867, on non-viral delivery, the nucleic acid sequences encoding a polypeptide can be 
inserted into conventional vectors that contain conventional control sequences for high level 
expression, and then be incubated with synthetic gene transfer molecules such as polymeric 
DNA-binding cations like polylysine, protamine, and albumin, linked to cell targeting ligands 
such as asialoorosomucoid, insulin, galactose, lactose, or transferrin. Other delivery systems 
include the use of liposomes to encapsulate DNA comprising the gene under the control of a 
variety of tissue-specific or ubiquitously-active promoters. Further non-viral delivery suitable 
for use includes mechanical delivery systems such as the approach described in Woffendin et 
al (1994) Proc. Natl Acad, Scl USA 91 (24): 1 1581-1 1585. Moreover, the coding sequence 
and the product of expression of such can be delivered through deposition of 
photopolymerized hydrogel materials. Other conventional methods for gene delivery that can 
be used for delivery of the coding sequence include, for example, use of hand-held gene 
transfer particle gun, as described in U.S. 5,149,655; use of ionizing radiation for activating 
transferred gene, as described in U.S. 5,206,152 and W092/1 1033. 

Exemplary liposome and polycationic gene delivery vehicles are those described in 
US 5,422,120 and 4,762,915; in WO 95/13796; W094/23697; and W091/14445; in EP- 
0524968; and in Stryer, Biochemistry, pages 236-240 (1975) W.H. Freeman, San Francisco; 
Szoka (1980) Biochem Biophys Acta 600:1; Bayer (1979) Biochem Biophys Acta 550:464; 
Rivnay (1987) Meth Enzymol 149:1 19; Wang (1987) Proc Natl Acad Sci 84:7851; Plant 
(1989) Anal Biochem 176:420. 
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A polynucleotide composition can comprises therapeutically effective amount of a 
gene therapy vehicle, as the term is defined above. For purposes of the present invention, an 
effective dose will be from about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg 
of the DNA constructs in the individual to which it is administered. 

Delivery Methods 

Once formulated, the polynucleotide compositions of the invention can be 
administered (1) directly to the subject; (2) delivered ex vivo, to cells derived from the 
subject; or (3) in vitro for expression of recombinant proteins. The subjects to be treated can 
be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperltoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or lesion. 
Other modes of administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject 
are known in the art and described in eg. W093/ 14778. Examples of cells useful in ex vivo 
applications include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by the following procedures, for example, dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection 
of the DNA into nuclei, all well known in the art. 

Polynucleotide and polypeptide pharmaceutical compositions 

In addition to the pharmaceutically acceptable carriers and salts described above, the 
following additional agents can be used with polynucleotide and/or polypeptide compositions. 
A.Polypeptides 

One example are polypeptides which include, without limitation: asioloorosomucoid 
(ASOR); transferrin; asialoglycoproteins; antibodies; antibody fragments; ferritin; 
interleukins; interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 
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granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating factor 
(M-CSF), stem cell factor and erythropoietin. Viral antigens, such as envelope proteins, can 
also be used. Also, proteins from other invasive organisms, such as the 17 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RII. 

B. Hormones, Vitamins, Etc. 

Other groups that can be included are, for example: hormones, steroids, androgens, 
estrogens, thyroid hormone, or vitamins, folic acid. 

C. Polyalkylenes, Polysaccharides, etc. 

Also, polyalkylene glycol can be included with the desired polynucleotides or 
polypeptides. In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di-, or polysaccarides can be included. In a preferred embodiment of this 
aspect, the polysaccharide is dextran or DEAE-dextran. Also, chitosan and 
poly(lactide-co-glycolide) 

D. Lipids, and Liposomes 

The desired polynucleotide or polypeptide can also be encapsulated in lipids or 
packaged in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are able to 
stably bind or entrap and retain nucleic acid. The ratio of condensed polynucleotide or 
polypeptide to lipid preparation can vary but will generally be around 1 : 1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for 
delivery of nucleic acids, see, Hug and Sleight (1991) Biochim. Biophys, Acta. 1097:1-17; 
Straubinger (1983) Meth. EnzymoL 101:512-527. 

Liposomal preparations for use in the present invention include cationic (positively 
charged), anionic (negatively charged) and neutral preparations. Cationic liposomes have 
been shown to mediate intracellular delivery of plasmid DNA (Feigner (1987) Proc. Natl. 
Acad. ScL USA 84:7413-7416); mRNA (Malone (1989) Proc. Natl. Acad. ScL USA 
86:6077-6081); and purified transcription factors (Debs (1990) J. Biol Chem. 
265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 
under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, Feigner 
supra). Other commercially available liposomes include transfectace (DDAB/DOPE) and 
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DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from readily available 
materials using techniques well known in the art. See, eg. Szoka (1978) Proc. Natl Acad. Sci. 
USA 75:4194-4198; WO90/1 1092 for a description of the synthesis of DOTAP 
(1 ,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, AL), or can be easily prepared using readily available materials. 
Such materials include phosphatidyl choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed 
with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs). The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See eg. Straubinger (1983) Meth. 
Immunol. 101:512-527; Szoka (1978) Proc. Natl Acad. Sci. USA 75:4194-4198; 
Papahadjopoulos (1975) Biochim. Biophys. Acta 394:483; Wilson (1979) Cell 17:77); Deamer 
& Bangham (1976) Biochim. Biophys. Acta 443:629; Ostro (1977) Biochem. Biophys. Res. 
Commun. 76:836; Fraley (1979) Proc. Natl. Acad. Set USA 76:3348); Enoch & Strittmatter 
(1979) Proc. Natl. Acad. Sci. USA 76:145; Fraley (1980) J. Biol. Chem. (1980) 255:10431; 
Szoka & Papahadjopoulos (1978) Proc. Natl Acad. Sci USA 75:145; and Schaefer-Ridder 
(1982) Science 215:166. 
EXipoproteins 

In addition, lipoproteins can be included with the polynucleotide or polypeptide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, IDL, LDL, 
and VLDL. Mutants, fragments, or fusions of these proteins can also be used. Also, 
modifications of naturally occurring lipoproteins can be used, such as acetylated LDL. These 
lipoproteins can target the delivery of polynucleotides to cells expressing lipoprotein 
receptors. Preferably, if lipoproteins are including with the polynucleotide to be delivered, no 
other targeting ligand is included in the composition. 

Naturally occurring lipoproteins comprise a lipid and a protein portion. The protein 
portion are known as apoproteins. At the present, apoproteins A, B, C, D, and E have been 
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isolated and identified. At least two of these contain several proteins, designated by Roman 
numerals, Al, All, A1V; CI, CII, CIIL 

A lipoprotein can comprise more than one apoprotein. For example, naturally 
occurring chylomicrons comprises of A, B, C, and E, over time these lipoproteins lose A and 
acquire C and E apoproteins. VLDL comprises A, B, C, and E apoproteins, LDL comprises 
apoprotein B; and HDL comprises apoproteins A, C, and E. 

The amino acid of these apoproteins are known and are described in, for example, 
Breslow (1985) Annu Rev. Biochem 54:699; Law (1986) Adv. Exp Med. Biol 151 :162; Chen 
(1986) J Biol Chem 261:12918; Kane (1980) Proc Natl Acad Sci USA 77:2465; and Utermann 
(1984) Hum Genet 65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, cholesterol (free and 
esters), and phopholipids. The composition of the lipids varies in naturally occurring 
lipoproteins. For example, chylomicrons comprise mainly triglycerides. A more detailed 
description of the lipid content of naturally occurring lipoproteins can be found, for example, 
in Meth. Enzymol. 128 (1986). The composition of the lipids are chosen to aid in 
conformation of the apoprotein for receptor binding activity. The composition of lipids can 
also be chosen to facilitate hydrophobic interaction and association with the polynucleotide 
binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by ultracentrifugation, for 
instance. Such methods are described in Meth. Enzymol. (supra); Pitas (1980) J. Biochem. 
255:5454-5460 and Mahey (1979) J Clin. Invest 64:743-750. 

Lipoproteins can also be produced by in vitro or recombinant methods by expression 
of the apoprotein genes in a desired host cell. See, for example, Atkinson (1986) Annu Rev 
Biophys Chem 15:403 and Radding (1958) Biochim Biophys Acta 30: 443. 

Lipoproteins can also be purchased from commercial suppliers, such as Biomedical 
Techniologies, Inc., Stoughton, Massachusetts, USA. 

Further description of lipoproteins can be found in Zuckermann et aL, PCT. Appln. 
No. US97/14465. 
F.Polycationic Agents 

Polycationic agents can be included, with or without lipoprotein, in a composition 
with the desired polynucleotide or polypeptide to be delivered. 
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Polycationic agents, typically, exhibit a net positive charge at physiological relevant 
pH and are capable of neutralizing the electrical charge of nucleic acids to facilitate delivery 
to a desired location. These agents have both in vitro, ex vivo, and in vivo applications. 
Polycationic agents can be used to deliver nucleic acids to a living subject either 
intramuscularly, subcutaneously, etc. 

The following are examples of useful polypeptides as polycationic agents: polylysine, 
polyarginine, polyornithine, and protamine. Other examples include histones, protamines, 
human serum albumin, DNA binding proteins, non-histone chromosomal proteins, coat 
proteins from DNA viruses, such as (XI 74, transcriptional factors also contain domains that 
bind DNA and therefore may be useful as nucleic aid condensing agents. Briefly, 
transcriptional factors such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, 
Oct-1, Oct-2, CREP, and TFIID contain basic domains that bind DNA sequences. 

Organic polycationic agents include: spermine, spermidine, and purtrescine. 

The dimensions and of the physical properties of a polycationic agent can be 
extrapolated from the list above, to construct other polypeptide polycationic agents or to 
produce synthetic polycationic agents. 

Synthetic Polycationic Agents 

Synthetic polycationic agents which are useful include, for example, DEAE-dextran, 
polybrene. LipofectinD, and lipofectAMINED are monomers that form polycationic 
complexes when combined with polynucleotides or polypeptides. 

Immunodiagnostic Assays 

Neisserial antigens of the invention can be used in immunoassays to detect antibody 
levels (or, conversely, anti-Neisserial antibodies can be used to detect antigen levels). 
Immunoassays based on well defined, recombinant antigens can be developed to replace 
invasive diagnostics methods. Antibodies to Neisserial proteins within biological samples, 
including for example, blood or serum samples, can be detected. Design of the immunoassays 
is subject to a great deal of variation, and a variety of these are known in the art. Protocols for 
the immunoassay may be based, for example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, use solid supports, or may be by 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. 
Assays which amplify the signals from the probe are also known; examples of which are 
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assays which utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such 
as ELISA assays. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are 
constructed by packaging the appropriate materials, including the compositions of the 
invention, in suitable containers, along with the remaining reagents and materials (for 
example, suitable buffers, salt solutions, etc.) required for the conduct of the assay, as well as 
suitable set of assay instructions. 

Nucleic Acid Hybridisation 

"Hybridization" refers to the association of two nucleic acid sequences to one another 
by hydrogen bonding. Typically, one sequence will be fixed to a solid support and the other 
will be free in solution. Then, the two sequences will be placed in contact with one another 
under conditions that favor hydrogen bonding. Factors that affect this bonding include: the 
type and volume of solvent; reaction temperature; time of hybridization; agitation; agents to 
block the non-specific attachment of the liquid phase sequence to the solid support 
(Denhardt ! s reagent or BLOTTO); concentration of the sequences; use of compounds to 
increase the rate of association of sequences (dextran sulfate or polyethylene glycol); and the 
stringency of the washing conditions following hybridization. See Sambrook et ah [supra] 
Volume 2, chapter 9, pages 9.47 to 9.57. 

"Stringency" refers to conditions in a hybridization reaction that favor association of 
very similar sequences over sequences that differ. For example, the combination of 
temperature and salt concentration should be chosen that is approximately 120 to 200 DC 
below the calculated Tm of the hybrid under study. The temperature and salt conditions can 
often be determined empirically in preliminary experiments in which samples of genomic 
DNA immobilized on filters are hybridized to the sequence of interest and then washed under 
conditions of different stringencies. See Sambrook et ah at page 9.50. 

Variables to consider when performing, for example, a Southern blot are (1) the 
complexity of the DNA being blotted and (2) the homology between the probe and the 
sequences being detected. The total amount of the fragment(s) to be studied can vary a 
magnitude of 10, from 0. 1 to 1 ixg for a plasmid or phage digest to 10' 9 to 10" 8 g for a single 
copy gene in a highly complex eukaryotic genome. For lower complexity polynucleotides, 
substantially shorter blotting, hybridization, and exposure times, a smaller amount of starting 
polynucleotides, and lower specific activity of probes can be used. For example, a single-copy 
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yeast gene can be detected with an exposure time of only 1 hour starting with 1 jig of yeast 
DNA, blotting for two hours, and hybridizing for 4-8 hours with a probe of 1 0 8 cpm/fig. For a 
single-copy mammalian gene a conservative approach would start with 10 jig of DNA, blot 
overnight, and hybridize overnight in the presence of 10% dextran sulfate using a probe of 
greater than 10 8 cpm/jig, resulting in an exposure time of -24 hours. 

Several factors can affect the melting temperature (Tm) of a DNA-DNA hybrid 
between the probe and the fragment of interest, and consequently, the appropriate conditions 
for hybridization and washing. In many cases the probe is not 100% homologous to the 
fragment. Other commonly encountered variables include the length and total G+C content of 
the hybridizing sequences and the ionic strength and formamide content of the hybridization 
buffer. The effects of all of these factors can be approximated by a single equation: 

Tm= 81 + 16.6(log 10 Ci) + 0.4[%(G + C)]-0.6(%formamide) - 600/«-1.5(%mismatch). 

where Ci is the salt concentration (monovalent ions) and n is the length of the hybrid 
in base pairs (slightly modified from Meinkoth & Wahl (1984) Anal, Biochem. 138: 267-284). 

In designing a hybridization experiment, some factors affecting nucleic acid 
hybridization can be conveniently altered. The temperature of the hybridization and washes 
and the salt concentration during the washes are the simplest to adjust. As the temperature of 
the hybridization increases (ie. stringency), it becomes less likely for hybridization to occur 
between strands that are nonhomologous, and as a result, background decreases. If the 
radiolabeled probe is not completely homologous with the immobilized fragment (as is 
frequently the case in gene family and interspecies hybridization experiments), the 
hybridization temperature must be reduced, and background will increase. The temperature of 
the washes affects the intensity of the hybridizing band and the degree of background in a 
similar manner. The stringency of the washes is also increased with decreasing salt 
concentrations. 

In general, convenient hybridization temperatures in the presence of 50% formamide 
are 42 DC for a probe with is 95% to 100% homologous to the target fragment, 37 DC for 90% 
to 95% homology, and 32 DC for 85% to 90% homology. For lower homologies, formamide 
content should be lowered and temperature adjusted accordingly, using the equation above. If 
the homology between the probe and the target fragment are not known, the simplest 
approach is to start with both hybridization and wash conditions which are nonstringent. If 
non-specific bands or high background are observed after autoradiography, the filter can be 
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washed at high stringency and reexposed. If the time required for exposure makes this 
approach impractical, several hybridization and/or washing stringencies should be tested in 
parallel. 

Nucleic Acid Probe Assays 

Methods such as PCR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes according to the invention can determine the presence of cDNA or 
mRNA. A probe is said to "hybridize" with a sequence of the invention if it can form a duplex 
or double stranded complex, which is stable enough to be detected. 

The nucleic acid probes will hybridize to the Neisserial nucleotide sequences of the 
invention (including both sense and antisense strands). Though many different nucleotide 
sequences will encode the amino acid sequence, the native Neisserial sequence is preferred 
because it is the actual sequence present in cells. mRNA represents a coding sequence and so 
a probe should be complementary to the coding sequence; single-stranded cDNA is 
complementary to mRNA, and so a cDNA probe should be complementary to the non-coding 
sequence. 

The probe sequence need not be identical to the Neisserial sequence (or its 
complement) — some variation in the sequence and length can lead to increased assay 
sensitivity if the nucleic acid probe can form a duplex with target nucleotides, which can be 
detected. Also, the nucleic acid probe can include additional nucleotides to stabilize the 
formed duplex. Additional Neisserial sequence may also be helpful as a label to detect the 
formed duplex. For example, a non-complementary nucleotide sequence may be attached to 
the 5' end of the probe, with the remainder of the probe sequence being complementary to a 
Neisserial sequence. Alternatively, non-complementary bases or longer sequences can be 
interspersed into the probe, provided that the probe sequence has sufficient complementarity 
with the a Neisserial sequence in order to hybridize therewith and thereby form a duplex 
which can be detected. 

The exact length and sequence of the probe will depend on the hybridization 
conditions, such as temperature, salt condition and the like. For example, for diagnostic 
applications, depending on the complexity of the analyte sequence, the nucleic acid probe 
typically contains at least 10-20 nucleotides, preferably 15-25, and more preferably at least 30 
nucleotides, although it may be shorter than this. Short primers generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. 



WO 99/57280 



51 



PCT/US99/09346 



Probes may be produced by synthetic procedures, such as the triester method of 
Matteucci et al [J. Am. Chem. Soc. (1981) 103:3185], or according to Urdea et al [Proc. 
Natl. Acad. ScL USA (1983) 80: 7461], or using commercially available automated 
oligonucleotide synthesizers. 

The chemical nature of the probe can be selected according to preference. For certain 
applications, DNA or RNA are appropriate. For other applications, modifications may be 
incorporated eg. backbone modifications, such as phosphorothioates or methylphosphonates, 
can be used to increase in vivo half-life, alter RNA affinity, increase nuclease resistance etc. 
[eg. see Agrawal & Iyer (1995) Curr Opin Biotechnol 6:12-19; Agrawal (1996) TIBTECH 
14:376-387]; analogues such as peptide nucleic acids may also be used [eg. see Corey (1997) 
TIBTECH 15:224-229; Buchardt et al (1993) TIBTECH 1 1:384-386]. 

One example of a nucleotide hybridization assay is described by Urdea et al. in 
international patent application WO92/02526 [see also US patent 5,124,246]. 

Alternatively, the polymerase chain reaction (PCR) is another well-known means for 
detecting small amounts of target nucleic acids. The assay is described in: Mullis et al [Meth. 
Enzymol (1987) 155: 335-350]; US patent 4,683,195; and US patent 4,683,202. Two 
"primer" nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can comprise sequence that does not hybridize to the sequence of the 
amplification target (or its complement) to aid with duplex stability or, for example, to 
incorporate a convenient restriction site. Typically, such sequence will flank the desired 
Neisserial sequence. 

A thermostable polymerase creates copies of target nucleic acids from the primers 
using the original target nucleic acids as a template. After a threshold amount of target nucleic 
acids are generated by the polymerase, they can be detected by more traditional methods, such 
as Southern blots. When using the Southern blot method, the labelled probe will hybridize to 
the Neisserial sequence (or its complement). 

Also, mRNA or cDNA can be detected by traditional blotting techniques described in 
Sambrook et al [supra]. mRNA, or cDNA generated from mRNA using a polymerase 
enzyme, can be purified and separated using gel electrophoresis. The nucleic acids on the gel 
are then blotted onto a solid support, such as nitrocellulose. The solid support is exposed to a 
labelled probe and then washed to remove any unhybridized probe. Next, the duplexes 
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containing the labeled probe are detected. Typically, the probe is labelled with a radioactive 
moiety. 

EXAMPLES 

The examples describe nucleic acid sequences which have been identified in 
N. meningitidis, and K gonorrhoeae along with their respective and putative translation 
products. Not all of the nucleic acid sequences are complete ie. they encode less than the full- 
length wild-type protein. 

The examples are generally in the following format: 

• a nucleotide sequence which has been identified in N. meningitidis 

• the putative translation product of said N. meningitidis sequence 

• a computer analysis of said translation product based on database comparisons 

• a corresponding nucleotide sequence identified from N. gonorrhoeae 

• the putative translation product of said N. gonorrhoeae sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N, gonorrhoeae sequence. 

• a corresponding nucleotide sequence identified from strain A of N. meningitidis 

• the putative translation product of said N. meningitidis strain A sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a description of the characteristics of the protein which indicates that it might be 
suitably antigenic or immunogenic. 

Sequence comparisons were performed at NCBI (http://www.ncbi.nlm.nih.gov) using 
the algorithms BLAST, BLAST2, BLASTn, BLASTp, tBLASTn, BLASTx, & tBLASTx [eg 
see also Altschul et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Research 25:2289-3402]. Searches were performed 
against the following databases: non-redundant GenBank+EMBL+DDBJ+PDB sequences 
and non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR sequences. 

Dots within nucleotide sequences represent nucleotides which have been arbitrarily 
introduced in order to maintain a reading frame. In the same way, double-underlined 
nucleotides were removed . Lower case letters represent ambiguities which arose during 
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alignment of independent sequencing reactions (some of the nucleotide sequences in the 
examples are derived from combining the results of two or more experiments). 

Nucleotide sequences were scanned in all six reading frames to predict the presence of 
hydrophobic domains using an algorithm based on the statistical studies of Esposti et ah 
[Critical evaluation of the hydropathy of membrane proteins (1990) Eur J Biochem 190:207- 
219]. These domains represent potential transmembrane regions or hydrophobic leader 
sequences. 

Open reading frames were predicted from fragmented nucleotide sequences using the 
program ORFFINDER (NCBI). 

Underlined amino acid sequences indicate possible transmembrane domains or leader 
sequences in the ORFs, as predicted by the PSORT algorithm (http://www.psort.nibb.ac.jp). 
Functional domains were also predicted using the MOTIFS program (GCG Wisconsin & 
PROSITE). 

For each of the following examples: based on the presence of a putative leader 
sequence and/or several putative transmembrane domains (single-underlined) in the 
gonococcal protein, it is predicted that the proteins from N. meningitidis and N. gonorrhoeae, 
and their respective epitopes, could be useful antigens or immunogenic compositions for 
vaccines or diagnostics. 

The standard techniques and procedures which may be employed in order to perform 
the invention {e.g. to utilize the disclosed sequences for vaccination or diagnostic purposes) 
were summarized above. This summary is not a limitation on the invention but, rather, gives 
examples that may be used, but are not required. 

In particular, the following methods were used to express, purify and biochemically 
characterize the proteins of the invention. 
Chromosomal DNA Preparation 

N. meningitidis strain 2996 was grown to exponential phase in 100ml of GC medium, 
harvested by centrifugation, and resuspended in 5ml buffer (20%(w/v) Sucrose, 50mM Tris- 
HC1, 50mM EDTA, pH8). After 10 minutes incubation on ice, the bacteria were lysed by 
adding 10ml of lysis solution (50mM NaCl, 1% Na-Sarkosyl, 50jig/ml Proteinase K), and the 
suspension incubated at 37°C for 2 hours. Two phenol extractions (equilibrated to pH 8) and 
one CHCl 3 /isoamylalcohol (24:1) extraction were performed. DNA was precipitated by 
addition of 0.3M sodium acetate and 2 volumes of ethanol, and collected by centrifugation. 
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The pellet was washed once with 70%(v/v) ethanol and redissolved in 4.0ml TE buffer 
(lOmM Tris-HCl, ImM EDTA, pH 8.0). The DNA concentration was measured by reading 
the OD at 260 nm. 

Oligonucleotide design 

Synthetic oligonucleotide primers were designed on the basis of the coding sequence 
of each ORF, using (a) the meningococcus B sequence when available, or (b) the 
gonococcus/meningococcus A sequence, adapted to the codon preference usage of 
meningococcus as necessary. Any predicted signal peptides were omitted, by designing the 5 5 
primers to sequence immediately downstream from the predicted leader sequence. 

For most ORFs, the 5' primers included two restriction enzyme recognition sites 
(BamHl-Ndel, BamHl-Nhel, EcoRl-Ndel or EcoRl-Nhel\ depending on the restriction pattern 
of the gene of interest. The 3 ' primers included a Xhol or a Hindlll restriction site (table 1). 
This procedure was established in order to direct the cloning of each amplification product 
(corresponding to each ORF) into two different expression systems: pGEX-KG (using 
BamHl-Xhol, BamHl-Hindlll, EcoRl-Xhol or EcoRl-Hindlll), and pET2 lb+ (using Ndel- 
Xhol, Nhel-Xhol, Ndel-Hindlll or Nhel-Hindlll). 

5 '-end primer tail: CGC GGATCCCATATG (BamHl-Ndel ) 

C GC GGATCCGCTAGC (BamHl-Nhel) 
CCG GAATTC T ACATATG (EcoRl-Ndel) 
CCG GAATTC T AGCTAGC (EcoRl-Nhel) 

3'-end primer tail: CCCGCTCGAG (Xhol) 

CCCG CTCGAG (HindUl) 

For cloning ORFs into the pGEX-His vector, the 5' and V primers contained only one 
restriction enzyme site (EcoRI, Kpnl or Sail for the 5' primers and Pstl, Xbal, Sphl or Sail for 
the V primers). Again restriction sites were chosen according to the particular restriction 
pattern of the gene (table 1). 

5'-end primer tail: ( AAA) AA AGAATTC (EcoRl ) 

( AAA ) AAA GGTACC (Kpnl) 
3'-end primer tail: (AAA) AAA CTGCAG (Pstl) 

( AAA ) AAA TCTAGA (Xbal) 
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AAAGCATGC 



(Sphl) 



5' or 3'-end primer tail: 



AAAAAAGTCGAC 



(Sail) 



As well as containing the restriction enzyme recognition sequences, the primers 
included nucleotides which hybridized to the sequence to be amplified. The melting 
temperature depended on the number and type of hybridising nucleotides in the whole primer, 
and was determined for each primer using the formulae: 



T m = 64.9 + 0.41 (% GC) - 600/N (whole primer) 

The melting temperatures of the selected oligonucleotides were usually 65-70°C for 
the whole oligo and 50-55°C for the hybridising region alone. 

Table 1 shows the forward and reverse primers used for each amplification. In certain 
cases, the sequence of the primer does not exactly match the sequence of the predicted ORF. 
This is because when initial amplifications were performed, the complete 5* and/or 3' 
sequences for some meningococcal B ORFs were not known. However the corresponding 
sequences had been identified in Gonococcus or in Meningoccus A. Hence, when the 
Meningoccus B sequence was incomplete or uncertain, Gonococcal or Meningococcal A 
sequences were used as the basis for primer design. These sequences were altered to take 
account of codon preference. It can be appreciated that, once the complete sequence is 
identified, this approach will no longer be necessary. 

Oligonucleotides were synthesized using a Perkin Elmer 394 DNA/RNA Synthesizer, 
eluted from the columns in 2.0ml NH4OH, and deprotected by 5 hours incubation at 56°C. 
The oligos were precipitated by addition of 0.3M Na- Acetate and 2 volumes ethanol. The 
samples were centrifuged and the pellets resuspended in either 100p,l or 1.0ml of water. The 
OD260 was determined using a Perkin Elmer Lambda Bio spectophotometer and the 
concentration adjusted to 2-10pmol/jal. 

Amplification 

The standard PCR protocol was as follows: 50-200ng of genomic DNA was used as a 
template in the presence of 20-40fiM of each oligonucletide primer, 400-800|aM dNTPs 
solution, lx PCR buffer (including 1.5mM MgCh), 2.5 units TaqI DNA polymerase (using 



T m =4(G+C)+2(A+T) 



(tail excluded) 
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Perkin-Elmer AmpliTaQ, GIBCO Platinum, Pwo DNA polymerase, or Tahara Shuzo Taq 
polymerase). In some cases, PCR was optimsed by the addition of lOjil DMSO or 50)0.1 2M 
Betaine. 

After a hot start (adding the polymerase during a preliminary 3 minute incubation of 
the whole mix at 95°C), each sample underwent a two-step amplification. The first 5 cycles 
were performed using the hybridization temperature that excluded the restriction enzyme tail 
of the primer (see above). This was followed by 30 cycles using the hybridization temperature 
calculated for the whole length oligos. The cycles were completed with a 10 minute extension 
step at 72°C. The standard cycles were as follows: 





Denaturation 


Hybridisation 


Elongation 


First 5 cycles 


30 seconds 
95°C 


30 seconds 
50-55°C 


30-60 seconds 
72°C 


Last 30 cycles 


30 seconds 
95°C 


30 seconds 
65-70°C 


30-60 seconds 
72°C 



Elongation times varied according to the length of the ORF to be amplified. 
Amplifications were performed using either a 9600 or a 2400 Perkin Elmer GeneAmp PCR 
System. To check the results, 1/10 of the amplification volume was loaded onto a 1-1.5% 
(w/v) agarose gel and the size of each amplified fragment compared with a DNA molecular 
weight marker. 

The amplified DNA was either loaded directly on a 1% agarose gel or first 
precipitated with ethanol and resuspended in a volume suitable to be loaded on a 1.0% 
agarose gel. The DNA fragment corresponding to the band of correct size was purified using 
the Qiagen Gel Extraction Kit, following the manufacturer's protocol. DNA fragments were 
eluted in a volume of 30jxl or 50jal with either H20 or lOmM Tris, pH 8.5. 

Digestion of PCR fragments 

The purified DNA corresponding to the amplified fragment was doubly-digested with 
the appropriate restriction enzymes for; cloning into pET-21b-t- and expressing the protein as a 
C-terminus His-tagged fusion, for cloning into pGEX-KG and expressing the protein as a N- 
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terminus GST-fusion, and for cloning into pGEX-His and expressing the protein as a 
N-terminus GST-His tagged fusion. 

Each purified DNA fragment was incubated at 37°C for 3 hours to overnight with 20 
units of appropriate restriction enzyme (New England Biolabs) in a volume of either 30 or 
40[il in the presence of suitable digestion buffer. Digested fragments were purified using the 
QIAquick PGR purification kit (following the manufacturer's instructions) and eluted in a 
volume of 30^,1 or 50|al with either H20 or lOmM Tris, pH 8.5. The DNA concentration was 
determined by quantitative agarose gel electrophoresis (1.0% gel) in the presence of a titrated 
molecular weight marker. 

Digestion of the cloning vectors (pET22B, pGEX-KG, pTRC-His A, pET21b+, pGEX- 
KG, and pGEX-His) 

The vector pGEX-His is a modified pGEX-2T vector carrying a region encoding six 
histidine residues upstream of the thrombin cleavage site and containing the multiple cloning 
site of the vector pTRC99 (Pharmacia). 10 p,g plasmid was double-digested with 50 units of 
each restriction enzyme in 200 p,l reaction volume in the presence of appropriate buffer by 
overnight incubation at 37°C. After loading the whole digestion on a 1% agarose gel, the 
band corresponding to the digested vector was purified from the gel using the Qiagen 
QIAquick Gel Extraction Kit and the DNA was eluted in 50 pi of 10 mM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 2 6o of the sample, and adjusted to 50 
^ig/]xl. 1 fal of plasmid was used for each cloning procedure. 

lOjig of plasmid vector was doubly-digested with 50 units of each restriction enzyme 
in a volume of 200jli1 with the appropriate buffer overnight at 37°C. The digest was loaded 
onto a 1 .0% agarose gel and the band corresponding to the digested vector purified using the 
Qiagen QIAquick Gel Extraction Kit. DNA was eluted in 50|li1 of lOmM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 2 60nm and the concentration adjusted 
to 50jag/fil. 1 of plasmid was used for each cloning procedure. 

Cloning 

For some ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated in both pET22b and pGEX-KG. In a final volume of 20 (il, a molar 
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ratio of 3: 1 fragment/vector was ligated using 0.5 jal of NEB T4 DNA ligase (400 units/jal), in 
the presence of the buffer supplied by the manufacturer. The reaction was incubated at room 
temperature for 3 hours. In some experiments, ligation was performed using the Boheringer 
"Rapid Ligation Kit", following the manufacturer's instructions. 

In order to introduce the recombinant plasmid in a suitable strain, 100 yil E. coli DH5 
competent cells were incubated with the ligase reaction solution for 40 minutes on ice, then at 
37°C for 3 minutes, then, after adding 800 ^il LB broth, again at 37°C for 20 minutes. The 
cells were then centrifuged at maximum speed in an Eppendorf microfuge and resuspended in 
approximately 200 jal of the supernatant. The suspension was then plated on LB ampicillin 
(lOOmg/ml). 

The screening of the recombinant clones was performed by growing 5 
randomly-chosen colonies overnight at 37 °C in either 2 ml (pGEX or pTC clones) or 5ml 
(pET clones) LB broth +100 ^ig/ml ampicillin. The cells were then pelletted and the DNA 
extracted using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions, to a final volume of 30 (al. 5 \il of each individual miniprep (approximately lg ) 
were digested with either NdeVXhol or BamHIIXhol and the whole digestion loaded onto a 1- 
1.5% agarose gel (depending on the expected insert size), in parallel with the molecular 
weight marker (1Kb DNA Ladder, GIBCO). The screening of the positive clones was made 
on the base of the correct insert size. 

For other ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated into both pET21b+ and pGEX-KG. A molar ratio of of 3: 1 
fragment/vector was used in a final volume of 20)il, that included 0.5|il T4 DNA ligase (400 
units/|al, NEB) and ligation buffer supplied by the manufacturer. The reaction was performed 
at room temperature for 3 hours. In some experiments, ligation was performed using the 
Boheringer "Rapid Ligation Kit" and the manufacturer's protocol. 

Recombinant plasmid was transformed into 100|xl of competent E. coli DH5 or 
HB101 by incubating the ligase reaction solution and bacteria for 40 minutes on ice then at 
37°C for 3 minutes. This was followed by the addition of 800|ul LB broth and incubation at 
37°C for 20 minutes. The cells were centrifuged at maximum speed in an Eppendorf 
microfuge, resuspended in approximately 200\x\ of the supernatant and plated onto LB 
ampicillin (lOOmg/ml ) agar. 
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Screening for recombinant clones was performed by growing 5 randomly selected 
colonies overnight at 37°C in either 2.0ml (pGEX-KG clones) or 5.0ml (pET clones) LB 
broth + 1 00|ag/ml ampicillin. Cells were pelleted and plasmid DNA extracted using the 
Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's instructions. 
Approximately lp,g of each individual miniprep was digested with the appropriate restriction 
enzymes and the digest loaded onto a 1-1.5% agarose gel (depending on the expected insert 
size), in parallel with the molecular weight marker (Ikb DNA Ladder, GIBCO). Positive 
clones were selected on the basis of the size of insert. 

ORFs were cloned into PGEX-His, by doubly-digesting the PCR product and ligating 
into similarly digested vector. After cloning, recombinant plasmids were transformed into the 
E.coli host W31 10. Individual clones were grown overnight at 37°C in LB broth with 50|xg/ml 
ampicillin. 

Certain ORFs may be cloned into the pGEX-HIS vector using EcoRl-Pstl cloning 
sites, or EcoRl-SaH, or Satl-Pstl, After cloning, the recombinant plasmids may be introduced 
in the^.coli host W3110. 

Expression 

Each ORF cloned into the expression vector may then be transformed into the strain 
suitable for expression of the recombinant protein product. 1 \i\ of each construct was used to 
transform 30 ^1 of E.coli BL21 (pGEX vector), E.coli TOP 10 (pTRC vector) or E.coli BL21- 
DE3 (pET vector), as described above. In the case of the pGEX-His vector, the same E.coli 
strain (W3 110) was used for initial cloning and expression. Single recombinant colonies were 
inoculated into 2ml LB-hAmp (100 jug/ml), incubated at 37°C overnight, then diluted 1 :30 in 
20 ml of LB+Amp (100 jig/ml) in 100 ml flasks, making sure that the ODeoo ranged between 
0.1 and 0.15. The flasks were incubated at 30°C into gyratory water bath shakers until OD 
indicated exponential growth suitable for induction of expression (0.4-0.8 OD for pET and 
pTRC vectors; 0.8-1 OD for pGEX and pGEX-His vectors). For the pET, pTRC and pGEX- 
His vectors, the protein expression was induced by addiction of ImM IPTG, whereas in the 
case of pGEX system the final concentration of IPTG was 0.2 mM. After 3 hours incubation 
at 30°C, the final concentration of the sample was checked by OD. In order to check 
expression, 1ml of each sample was removed, centrifuged in a microfuge, the pellet 
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resuspended in PBS, and analysed by 12% SDS-PAGE with Coomassie Blue staining. The 
whole sample was centrifuged at 6000g and the pellet resuspended in PBS for further use. 

GST-fusion proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on LB+Amp agar plate. 
The bacteria were inoculated into 20 ml of LB+Amp liquid colture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600 ml of fresh medium and allowed to 
grow at the optimal temperature (20-3 7°C) to OD 5 so 0.8-1 . Protein expression was induced 
with 0.2mM IPTG followed by three hours incubation. The culture was centrifuged at 8000 
rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 7.5 ml 
cold PBS. The cells were disrupted by sonication on ice for 30 sec at 40W using a Branson 
sonifier B-15, frozen and thawed two times and centrifuged again. The supernatant was 
collected and mixed with 150jj1 Glutatione-Sepharose 4B resin (Pharmacia) (previously 
washed with PBS) and incubated at room temperature for 30 minutes. The sample was 
centrifuged at 700g for 5 minutes at 4C. The resin was washed twice with 10 ml cold PBS for 
10 minutes, resuspended in 1ml cold PBS, and loaded on a disposable column. The resin was 
washed twice with 2ml cold PBS until the flow-through reached OD 2 8o of 0.02-0.06. The 
GST-fusion protein was eluted by addition of 700jj,1 cold Glutathione elution buffer 1 OmM 
reduced glutathione, 50mM Tris-HCl) and fractions collected until the OD280 was 0.1. 21 |il of 
each fraction were loaded on a 12% SDS gel using either Biorad SDS-PAGE Molecular 
weight standard broad range (Ml) (200, 1 16.25, 97.4, 66.2, 45, 31, 21.5, 14.4, 6.5 kDa) or 
Amersham Rainbow Marker (M") (220, 66, 46, 30, 21.5, 14.3 kDa) as standards. As the MW 
of GST is 26kDa, this value must be added to the MW of each GST- fusion protein. 

For other ORFs, for each clone to be purified as a GST-fusion, a single colony was 
streaked out and grown overnight at 37°C on a LB/Amp. (100|ag/ml) agar plate. An isolated 
colony from this plate was inoculated into 20ml of LB/Amp (100 ng/ml) liquid medium and 
grown overnight at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml 
LB/Amp (100|ag/ml) liquid medium and allowed to grow at the optimal temperature (20- 
37°C) until the ODssonm reached 0.6-0.8. Recombinant protein expression was induced by 
addition of IPTG (final concentration 0.2mM) and the culture incubated for a further 3 hours. 
Bacteria were harvested by centrifiigation at 8000xg for 15 min at 4°C. 
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The bacterial pellet was resuspended in 7.5ml cold PBS. Cells were disrupted by 
sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifuged at 
13 OOOxg for 30 min at 4°C. The supernatant was collected and mixed with 150^1 Glutatione- 
Sepharose 4B resin (Pharmacia), previously equilibrated with PBS, and incubated at room 
temperature with gentle agitation for 30 min. The batch-wise preparation was centrifuged at 
700xg for 5 min at 4°C and the supernatant discarded. The resin was washed twice 
(batchwise) with 10ml cold PBS for 10 min, resuspended in 1ml cold PBS, and loaded onto a 
disposable column. The resin continued to be washed with cold PBS, until the OD280nm of the 
flow-through reached 0.02-0.01. The GST-fusion protein was eluted by addition of 700jil cold 
glutathione elution buffer (lOmM reduced glutathione, 50mM Tris-HCl pH 8.0) and fractions 
collected, until the OD280nm of the eluate indicated all the recombinant protein was obtained. 
20(il aliquots of each elution fraction were analyzed by SDS-PAGE using a 12% gel. The 
molecular mass of the purified proteins was determined using either the Bio-Rad broad range 
molecular weight standard (Ml) (200, 116, 97.4, 66.2, 45.0, 31.0, 21.5, 14.4, 6.5 kDa) or the 
Amersham Rainbow Marker (M2) (220, 66.2, 46.0, 30.0, 21.5, 14.3 kDa). The molecular 
weights of GST- fusion proteins are a combination of the 26 kDa GST protein and its fusion 
partner. Protein concentrations were estimated using the Bradford assay. 

His-fusion soluble proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on a LB + Amp agar 
plate. The bacteria were inoculated into 20ml of LB+Amp liquid culture and incubated 
overnight in a water bath shaker. Bacteria were diluted 1 :30 into 600ml fresh medium and 
allowed to grow at the optimal temperature (20-37°C) to OD 550 0.6-0.8. Protein expression 
was induced by addition of 1 mM IPTG and the culture further incubated for three hours. The 
culture was centrifuged at 8000 rpm at 4°C, the supernatant was discarded and the bacterial 
pellet was resuspended in 7.5ml cold lOmM imidazole buffer (300 mM NaCl, 50 mM 
phosphate buffer, 10 mM imidazole, pH 8). The cells were disrupted by sonication on ice for 
30 sec at 40W using a Branson sonifier B-15, frozen and thawed two times and centrifuged 
again. The supernatant was collected and mixed with 1 50|il Ni 2+ -resin (Pharmacia) 
(previously washed with lOmM imidazole buffer) and incubated at room temperature with 
gentle agitation for 30 minutes. The sample was centrifuged at 700g for 5 minutes at 4°C. 
The resin was washed twice with 10 ml cold lOmM imidazole buffer for 10 minutes, 



WO 99/57280 



62 



PCT/US99/09346 



resuspended in 1ml cold lOmM imidazole buffer and loaded on a disposable column. The 
resin was washed at 4°C with 2ml cold lOmM imidazole buffer until the flow-through 
reached the O.D280 of 0.02-0.06. The resin was washed with 2ml cold 20mM imidazole 
buffer (300 mM NaCl, 50 mM phosphate buffer, 20 mM imidazole, pH 8) until the flow- 
through reached the O.D280 of 0.02-0.06. The His- fusion protein was eluted by addition of 
700^1 cold 250mM imidazole buffer (300 mM NaCl, 50 mM phosphate buffer, 250 mM 
imidazole, pH 8) and fractions collected until the O.D280 was 0.1. 21j^l of each fraction were 
loaded on a 12% SDS gel. 

His- fusion insoluble proteins large-scale purification. 

A single colony was grown overnight at 37 °C on a LB + Amp agar plate. The 
bacteria were inoculated into 20 ml of LB+Amp liquid culture in a water bath shaker and 
grown overnight. Bacteria were diluted 1:30 into 600ml fresh medium and let to grow at the 
optimal temperature (37°C) to O.D550 0.6-0.8. Protein expression was induced by addition 
of 1 mM IPTG and the culture further incubated for three hours. The culture was centrifiiged 
at 8000rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 
7.5 ml buffer B (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8). The cells 
were disrupted by sonication on ice for 30 sec at 40W using a Branson sonifier B-15, frozen 
and thawed twice and centrifuged again. The supernatant was stored at -20°C, while the 
pellets were resuspended in 2 ml guanidine buffer (6M guanidine hydrochloride, lOOmM 
phosphate buffer, 10 mM Tris-HCl, pH 7.5) and treated in a homogenizer for 10 cycles. The 
product was centrifuged at 13000 rpm for 40 minutes. The supernatant was mixed with 1 50(j.l 
Ni 2+ -resin (Pharmacia) (previously washed with buffer B) and incubated at room temperature 
with gentle agitation for 30 minutes. The sample was centrifuged at 700 g for 5 minutes at 
4°C. The resin was washed twice with 10 ml buffer B for 10 minutes, resuspended in 1ml 
buffer B, and loaded on a disposable column. The resin was washed at room temperature 
with 2ml buffer B until the flow-through reached the OD 2 80 of 0.02-0.06. The resin was 
washed with 2ml buffer C (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 6.3) until 
the flow-through reached the O.D280 of 0.02-0.06. The His-fusion protein was eluted by 
addition of 700^1 elution buffer (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 
4.5) and fractions collected until the OD 2 so was 0.1 . 21 jul of each fraction were loaded on a 
12% SDS gel. 
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Purification of His-fusion proteins. 

For each clone to be purified as a His-fusion, a single colony was streaked out and 
grown overnight at 37°C on a LB/Amp (100 fig/ml) agar plate. An isolated colony from this 
plate was inoculated into 20ml of LB/Amp (100 fig/ml) liquid medium and grown overnight 
at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml LB/Amp (100 
Hg/ml) liquid medium and allowed to grow at the optimal temperature (20-37°C) until the 
OD 55 onm reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG 
(final concentration 1 .OmM) and the culture incubated for a further 3 hours. Bacteria were 
harvested by centrifiigation at 8000xg for 15 min at 4°C. 

The bacterial pellet was resuspended in 7.5ml of either (i) cold buffer A (300mM 
NaCl, 50mM phosphate buffer, lOmM imidazole, pH 8.0) for soluble proteins or (ii) buffer B 
(8M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8) for insoluble proteins. Cells 
were disrupted by sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. For insoluble proteins, pellets were 
resuspended in 2.0 ml buffer C (6M guanidine hydrochloride, lOOmM phosphate buffer, 
lOmM Tris-HCl, pH 7.5) and treated with a Dounce homogenizer for 10 cycles. The 
homogenate was centrifuged at 13 OOOxg for 40 min and the supernatant retained. 

Supernatants for both soluble and insoluble preparations were mixed with 150jal Ni 2+ - 
resin (previously equilibrated with either buffer A or buffer B, as appropriate) and incubated 
at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast 
Flow (Pharmacia), prepared according to manufacturers protocol. The batch-wise preparation 
was centrifuged at 700xg for 5 min at 4°C and the supernatant discarded. The resin was 
washed twice (batch-wise) with 10ml buffer A or B for 10 min, resuspended in 1.0 ml buffer 
A or B and loaded onto a disposable column. The resin continued to be washed with either (i) 
buffer A at 4°C or (ii) buffer B at room temperature, until the OD 2 80nm of the flow-through 
reached 0.02-0.01. The resin was further washed with either (i) cold buffer C (300mM NaCl, 
50mM phosphate buffer, 20mM imidazole, pH 8.0) or (ii) buffer D (8M urea, lOmM Tris- 
HCl, lOOmM phosphate buffer, pH 6.3) until the the OD 2 8onm of the flow-through reached 
0.02-0.01. The His-fusion protein was eluted by addition of 700jal of either (i) cold elution 
buffer A (300mM NaCl, 50mM phosphate buffer, 250mM imidazole, pH 8.0) or (ii) elution 
buffer B (8 M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 4.5) and fractions 
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collected until the O.D2gonm indicated all the recombinant protein was obtained. 20jj.1 aliquots 
of each elution fraction were analyzed by SDS-PAGE using a 12% gel. Protein concentrations 
were estimated using the Bradford assay. 

His-fusion proteins renaturation 

In the cases where denaturation was required to solubilize proteins, a renaturation step 
was employed prior to immunization. Glycerol was added to the denatured fractions obtained 
above to give a final concentration of 10%(v/v). The proteins were diluted to 200(ig/ml using 
dialysis buffer I (10% (v/v) glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced 
glutathione, 0,5mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same 
buffer for 12-14 hours at 4°C. Further dialysis was performed with buffer II (10% (v/v) 
glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced glutathione, 0.5mM 
oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Alternatively, 10% glycerol was added to the denatured proteins. The proteins were 
then diluted to 20^g/ml using dialysis buffer I (10% glycerol, 0.5M arginine, 50mM 
phosphate buffer, 5mM reduced glutathione, 0.5mM oxidised glutathione, 2M urea, pH 8.8) 
and dialysed against the same buffer at 4°C for 12-14 hours. The protein was further dialysed 
against dialysis buffer II (10% glycerol, 0.5M arginine, 50mM phosphate buffer, 5mM 
reduced glutathione, 0.5mM oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Protein concentration was evaluated using the formula: 

Protein (mg/ml) = (1 .55 x OD 280 ) - (0.76 x OD 26 o) 

Purification of proteins 

To analyse the solubility, pellets obtained from 3.0ml cultures were resuspended in 
500^1 buffer Ml (PBS pH 7.2). 25fxl of lysozyme (lOmg/ml) was added and the bacteria 
incubated for 15 min at 4°C. Cells were disrupted by sonication on ice four times for 30 sec at 
40W using a Branson sonifier 450 and centrifuged at 13 OOOxg for 30 min at 4°C. The 
supernatant was collected and the pellet resuspended in buffer M2 [8M urea, 0.5M NaCl, 
20mM imidazole and 0.1M NaH 2 P0 4 ] and incubated for 3 to 4 hours at 4°C. After 
centrifugation, the supernatant was collected and the pellet resuspended in buffer M3 [6M 
guanidinium-HCl, 0.5M NaCl, 20mM imidazole and 0.1 M NaH 2 P0 4 ] overnight at 4°C. The 
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supernatants from all steps were analysed by SDS-PAGE. Some proteins were found to be 
soluble in PBS, others need urea or guanidium-HCl for solubilization. 

For preparative scale purifications, 500ml cultures were induced and fusion proteins 
solubilized in either buffer Ml, M2 or M3 using the procedure described above. Crude 
extracts were loaded onto a Ni-NTA superflow column (Quiagen) equilibrated with buffer 
Ml, M2 or M3 depending on the solubilization buffer employed. Unbound material was 
eluted by washing the column with the same buffer. The recombinant fusion protein was 
eluted with the corresponding buffer containing 500mM imidazole then dialysed against the 
same buffer in the absence of imidazole. 

Mice immunisations 

20^tg of each purified protein are used to immunise mice intraperitoneally. In the case 
of some ORFs, Balb-C mice were immunised with Al(OH) 3 as adjuvant on days 1, 21 and 42, 
and immune response was monitored in samples taken on day 56. For other ORFs, CD1 mice 
could be immunised using the same protocol. For ORFs 25 and 40, CD1 mice were 
immunised using Freund's adjuvant, and the same immunisation protocol was used, except 
that the immune response was measured on day 42, rather than 56. Similarly, for still other 
ORFs, CD1 mice were immunised with Freund's adjuvant, but the immune response was 
measured on day 49. Alternatively, 20[ig of each purified protein was mixed with Freund's 
adjuvant and used to immunise CD1 mice intraperitoneally. For many of the proteins, the 
immunization was performed on days 1,21 and 35, and immune response was monitored in 
samples taken on days 34 and 49. For some proteins, the third immunization was performed 
on day 28, rather than 35, and the immune response was measured on days 20 and 42, rather 
than 34 and 49. 

ELISA assay (sera analysis) 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 7ml of Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD620- The 
bacteria were let to grow until the OD reached the value of 0.3-0.4. The culture was 
centrifuged for 10 minutes at 10000 rpm. The supernatant was discarded and bacteria were 
washed once with PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated 
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for 2 hours at room temperature and then overnight at 4°C with stirring. 100^1 bacterial cells 
were added to each well of a 96 well Greiner plate and incubated overnight at 4°C. The wells 
were then washed three times with PBT washing buffer (0.1% Tween-20 in PBS). 200 jjJ of 
saturation buffer (2.7% Polyvinylpyrrolidone 10 in water) was added to each well and the 
plates incubated for 2 hours at 37°C. Wells were washed three times with PBT. 200 jal of 
diluted sera (Dilution buffer: 1% BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to 
each well and the plates incubated for 90 minutes at 37°C. Wells were washed three times 
with PBT. 100 \i\ of HRP-conjugated rabbit anti-mouse (Dako) serum diluted 1 :2000 in 
dilution buffer were added to each well and the plates were incubated for 90 minutes at 37°C. 
Wells were washed three times with PBT buffer. 100 jal of substrate buffer for HRP (25 ml of 
citrate buffer pH5, 10 mg of O-phenildiamine and 10 jlxI of H 2 0) were added to each well and 
the plates were left at room temperature for 20 minutes. 100 jil H2SO4 was added to each 
well and OD490 was followed. The ELISA was considered positive when OD490 was 2.5 
times the respective pre-immune sera. 

Alternatively, The acapsulated MenB M7 strain was plated on chocolate agar plates 
and incubated overnight at 37°C. Bacterial colonies were collected from the agar plates using 
a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD620. The bacteria 
were let to grow until the OD reached the value of 0.3-0.4. The culture was centrifuged for 10 
minutes at 10 OOOrpm. The supernatant was discarded and bacteria were washed once with 
PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated for 1 hour at 37°C 
and then overnight at 4°C with stirring. 1 OOjal bacterial cells were added to each well of a 96 
well Greiner plate and incubated overnight at 4°C. The wells were then washed three times 
with PBT washing buffer (0.1% Tween-20 in PBS). 200|nl of saturation buffer (2.7% 
Polyvinylpyrrolidone 10 in water) was added to each well and the plates incubated for 2 hours 
at 37°C. Wells were washed three times with PBT. 200jal of diluted sera (Dilution buffer: 1% 
BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to each well and the plates incubated 
for 2 hours at 37°C Wells were washed three times with PBT. lOOjxl of HRP-conjugated 
rabbit anti-mouse (Dako) serum diluted 1:2000 in dilution buffer were added to each well and 
the plates were incubated for 90 minutes at 37°C. Wells were washed three times with PBT 
buffer. \00\il of substrate buffer for HRP (25ml of citrate buffer pH5, lOmg of O- 
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phenildiamine and 10/al of H2O2) were added to each well and the plates were left at room 
temperature for 20 minutes. lOOjal of 12.5% H2SO4 was added to each well and OD490 was 
followed. The ELISA titers were calculated abitrarely as the dilution of sera which gave an 
OD490 value of 0.4 above the level of preimmune sera. The ELISA was considered positive 
when the dilution of sera with OD490 of 0.4 was higher than 1 :400. 

FACScan bacteria Binding Assay procedure. 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 4 tubes containing 8ml each Mueller-Hinton Broth (Difco) 
containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following 
ODe20- The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture 
was centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and the pellet was 
resuspended in blocking buffer (1% BSA in PBS, 0.4% NaN3) and centrifuged for 5 minutes 
at 4000rpm. Cells were resuspended in blocking buffer to reach OD620 of 0.07. 100p,l bacterial 
cells were added to each well of a Costar 96 well plate. 100p.l of diluted (1 :100, 1:200, 1 :400) 
sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4°C. 
Cells were centrifuged for 5 minutes at 4000rpm, the supernatant aspirated and cells washed 
by addition of 200|^l/well of blocking buffer in each well. lOOjal of R-Phicoerytrin conjugated 
F(ab)2 goat anti -mouse, diluted 1 :100, was added to each well and plates incubated for 1 hour 
at 4°C. Cells were spun down by centrifiigation at 4000rpm for 5 minutes and washed by 
addition of 200(il/well of blocking buffer. The supernatant was aspirated and cells 
resuspended in 200jxl/well of PBS, 0.25% formaldehyde. Samples were transferred to 
FACScan tubes and read. The condition for FACScan (Laser Power 15mW) setting were: FL2 
on; FSC-H threshold:92; FSC PMT Voltage: E 01; SSC PMT: 474; Amp. Gains 6.1; FL-2 
PMT: 586; compensation values: 0. 

OMV preparations 

Bacteria were grown overnight on 5 GC plates, harvested with a loop and resuspended 
in 10 ml 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30 minutes and the 
bacteria disrupted by sonication for 10' on ice ( 50% duty cycle, 50% output ). Unbroken 
cells were removed by centrifiigation at SOOOg for 10 minutes and the total cell envelope 
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fraction recovered by centrifugation at 50000g at 4°C for 75 minutes. To extract cytoplasmic 
membrane proteins from the crude outer membranes, the whole fraction was resuspended in 
2% sarkosyl (Sigma) and incubated at room temperature for 20 minutes. The suspension was 
centrifuged at lOOOOg for 10 minutes to remove aggregates, and the supernatant further 
ultracentrifuged at 50000g for 75 minutes to pellet the outer membranes. The outer 
membranes were resuspended in lOmM Tris-HCl, pH8 and the protein concentration 
measured by the Bio-Rad Protein assay, using BSA as a standard. 

Whole Extracts preparation 

Bacteria were grown overnight on a GC plate, harvested with a loop and resuspended 
in 1ml of 20mM Tris-HCI. Heat inactivation was performed at 56°C for 30' minutes. 

Western blotting 

Purified proteins (500ng/lane), outer membrane vesicles (5p.g) and total cell extracts 
(25|ig) derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and 
transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 150mA at 
4°C, using transfer buffer (0.3 % Tris base, 1 .44 % glycine, 20% (v/v) methanol). The 
membrane was saturated by overnight incubation at 4°C in saturation buffer (10% skimmed 
milk, 0.1% Triton XI 00 in PBS). The membrane was washed twice with washing buffer (3% 
skimmed milk, 0.1% Triton XI 00 in PBS) and incubated for 2 hours at 37°C with mice sera 
diluted 1 :200 in washing buffer. The membrane was washed twice and incubated for 90 
minutes with a 1 :2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The 
membrane was washed twice with 0.1% Triton X100 in PBS and developed with the Opti- 
4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water. 

Bactericidal assay 

MC58 and 2996 strains were grown overnight at 37°C on chocolate agar plates. 5-7 
colonies were collected and used to inoculate 7ml Mueller-Hinton broth. The suspension was 
incubated at 37°C on a nutator and let to grow until OD 6 2o was in between 0.5-0.8. The 
culture was aliquoted into sterile 1.5ml Eppendorf tubes and centrifuged for 20 minutes at 
maximum speed in a microfuge. The pellet was washed once in Gey's buffer (Gibco) and 
resuspended in the same buffer to an OD 620 of 0.5, diluted 1 :20000 in Gey's buffer and stored 
at 25°C 



WO 99/57280 



69 



PCT/US99/09346 



50^1 of Gey's buffer/1% BSA was added to each well of a 96-well tissue culture plate. 
25|il of diluted (1 : 100) mice sera (dilution buffer: Gey's buffer/0.2% BSA) were added to 
each well and the plate incubated at 4°C. 25|xl of the previously described bacterial 
suspension were added to each well. 25^1 of either heat-inactivated (56°C waterbath for 30 
minutes) or normal baby rabbit complement were added to each well. Immediately after the 
addition of the baby rabbit complement, 22jal of each sample/well were plated on Mueller- 
Hinton agar plates (time 0). The 96-well plate was incubated for 1 hour at 37°C with rotation 
and then 22|il of each sample/well were plated on Mueller-Hinton agar plates (time 1). After 
overnight incubation the colonies corresponding to time 0 and time lh were counted. 

Gene Variability 

The ORF4 and 91 9 genes were amplified by PCR on chromosomal DNA extracted 
from various Neisseria strains (see list of strains). The following oligonucleotides used as 
PCR primers were designed in the upstream and downstream regions of the genes: 

orf 4.1 (forward) CGAATCCGGACGGCAGGACTC 

orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 

919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 
919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

The PCR cycling conditions were: 

1 cycle 2 min. at 94° 

30 cycles 30 sec. at 94° 

30 sec. at ~ 54° or ~ 60° ( in according to Tm of the primers) 

40 sec. at 72° 
1 cycle 7 min. at 72° 

The PCR products were purified from 1 % agarose gel and sequenced using the 
following primers: 

orf 4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.2 (forward) CGACCGCGCCTTTGGGACTG 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 
orf 4.4 (reverse) TCTTTGAGTTTGATCCAACC 
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919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 

919.2 (forward) ATCCTTCCGCCTCGGCTGCG 

919.3 (forward) AAAACAGCGGCACAATCGAC 

919.4 (forward) ATAAGGGCTACCTCAAACTC 
919.5. (forward) GCGCGTGGATTATTTTTGGG 

919.6 (reverse) C AAAT AAG AAAGG AAT T T T G or 

GGTATCGCAAAACTTCGCCTTAATGCG 

919.7 (reverse) CCCAAGGTAATGTAGTGCCG 

919.8 (reverse) TAAAAAAAAGTTCGACAGGG 

919.9 (reverse) CCGTCCGCCTGTCGTCGCCC 

919.10 (reverse) TCGTTCCGGCGGGGTCGGGG 

All documents cited herein are incorporated by reference in their entireties. 
The following Examples are presented to illustrate, not limit, the invention 

EXAMPLE 1 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 1: Oligonucleotides used for PCR for Examples 2-10 



ORF 


Primer 


Sequence 


Restriction sites 


279 


Forward 
Reverse 


CGCGGATCCCATATG-TTGCCTGCAATCACGATT 
<SEQ ID 3021 > 

CCCGCTCGAG-TTTAGAAGCGGGCGGCAA <SEQ 
ID 3022> 


BamHI-Ndel 
Xhol 


519 


Forward 
Reverse 


CGCGGATCCCATATG-TTCAAATCCTTTGTCGTCA 
<SEQ ID 3023> 

CCCGCTCGAG-TTTGGCGG I I I I GCTGC <SEQ ID 
3024> 


BamHI-Ndel 
Xhol 


576 


Forward 
Reverse 


CGCGGATCCCATATG-GCCGCCCCCGCATCT 
<SEQ ID 3025> 

CCCGCTCGAG-ATTTAC I I I I I I GATGTCGAC 
<SEQ ID 3026> 


BamHI-Ndel 
Xhol 


919 


Forward 
Reverse 


CGCGGATCCCATATG-TGCCAAAGCAAGAGCATC 
<SEQ ID 3027> 

CCCGCTCGAG-CGGGCGGTATTCGGG <SEO ID 
3028> 


BamHI-Ndel 
Xhol 


121 


Forward 


CGCGGATCCCATATG-GAAACACAGCTTTACAT 
<SEQ ID 3029> 


BamHI-Ndel 
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Reverse 


CCCGCTCGAG-ATAATAATATCCCGCGCCC <SEQ 
ID 3030> 


Xhol 


128 


Forward 
Reverse 


CGCGGATCCCATATG-ACTGACAACGCACT <SEQ 
ID 3031 > 

CCCGCTCGAG-GACCGCGTTGTCGAAA <SEQ ID 
3032> 


BamHI-Ndel 
Xhol 


206 


Forward 
Reverse 


CGCGGATCCCATATG-AAACACCGCCAACCGA 
<SEQ ID 3033> 

CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC 
<SEQ ID 3034> 


BamHI-Ndel 
Xhol 


287 


Forward 
Reverse 


CCGGAATTCTAGCTAGC-CTTTCAGCCTGCGGG 
<SEQ ID 3035> 

CCCGCTCGAG-ATCCTGCTC I MM IGCC <SEQ ID 
3036> 


EcoRI-Nhel 
Xhol 


406 


Forward 
Reverse 


CGCGGATCCCATATG-TGCGGGACACTGACAG 
<SEQ ID 3037> 

CCCGCTCGAG-AGGTTGTCCTTGTCTATG <SEQ 
ID 3038> 


BamHI-Ndel 
Xhol 



Localization of the ORFs 

The following DNA and amino acid sequences are identified by titles of the following 
form: [g, m, or a] [#].[seq or pep], where "g" means a sequence from N. gonorrhoeae, "m" 
means a sequence from N. meningitidis and "a" means a sequence from N. meningitidis A; 
"#" means the number of the sequence; "seq" means a DNA sequence, and "pep" means an 
amino acid sequence. For example, "gOOl.seq" refers to an N. gonorrohoeae DNA sequence, 
number 1. The presence of the suffix "-1" to these sequences indicates an additional sequence 
found for the same ORF, thus, data for an ORF having both an unsuffixed and a suffixed 
sequence designation applies to both such designated sequences. Further, open reading 
frames are identified as ORF #, where "#" means the number of the ORF, corresponding to 
the number of the sequence which encodes the ORF, and the ORF designations may be 
suffixed with ".ng ,f or ".a", indicating that the ORF corresponds to a K gonorrhoeae sequence 
or a N. meningitidis A sequence, respectively. The word "partial" before a sequence indicates 
that the sequence may be a partial or a complete ORF. Computer analysis was performed for 
the comparisons that follow between "g", "m M , and "a" peptide sequences; and therein the 
"pep" suffix is implied where not expressly stated. Further, in the event of a conflict between 
the text immediately preceding and describing which sequences are being compared, and the 
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designated sequences being compared, the designated sequence controls and is the actual 
sequence being compared. 



ORF; contig: 

279 gnm4.seq 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 303 9>: 
m279.seq 

1 ATAACGCGGA TTTGCGGCTG CTTGATTTCA ACGGTTTTCA GGGCTTCGGC 

51 AAGTTTGTCG GCGGCGGGTT TCATCAGGCT GCAATGGGAA GGTACGGACA 

101 CGGGCAGCGG CAGGGCGCGT TTGGCACCGG CTTCTTTGGC GGCAGCCATG 

151 GCGCGTCCGA CGGCGGCGGC GTTGCCTGCA ATCACGATTT GTCCGGGTGA 

201 GTTGAAGTTG ACGGCTTCGA CCACTTCGCT TTGGGCGGCT TCGGCACAAA 

251 TGGCTTTAAC CTGCTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCAT CAGT TCGGCGCGCA GGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT TCAATGCGCC GGCGG CAACG AGTGCGGTGT 

401 ATTCGCCGAG GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

451 TCTAAATAG 

This corresponds to the amino acid sequence <SEQ ID 3040; ORF 279>: 
m279.pep 

1 ITRICGCLIS TVFRASASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 

51 ARPTAAA LPA ITICPGELKL TASTTSLWAA SAQMALTCSS SKPRIAAIAP 

101 TPCGTADCIS SARRRTSLTA SAKFNAPAAT SAVYSPRLCP ATAAGVLPPA 

151 SK* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3041>: 

g279 -seq 

1 atgacgcgga tttgcggctg cttgatttca acggttttga gtgtttcggc 

51 aagtttgtcg gcggcgggtt tcatcaggct gcaatgggaa ggaacggata 

101 ccggcagcgg cagggcgcgt ttggctccgg cttctttggc ggcagccatg 

151 gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcga 

201 gttgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 

251 tctgcctgac ctgttcatct tccaaaccca aaatggccgc cattgcgcct 

3 01 acgccttgcg gtacggcgga ctgcatcagt tcggcgcgca ggcggacgag 

351 tttgacggca tcggcaaaat ccaatgcttc ggcggcgaca agcgcggtgt 

401 attcgccgag gctgtgtccg gcaacggcgg caggcgtttt gccgcccact 

451 tccaaatag 

This corresponds to the amino acid sequence <SEQ ID 3042; ORF 279.ng>: 

g279 -pep 

1 MTRICGCLIS TVLSVSASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 VRPTAAA L PA ITTCPGELKL TASTTSPCAD SAQICLTCSS SKPKMAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVLPPT 
151 SK* 

ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

m279 . pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

MINIMUM: MMIltlllMIIIIIIIIIIMIIMIIIIIIIhimiMM 

g2 79 MTR I CG CL I S TVLS VS AS L S AAGF I RLQWEGTDTG S GRARLAP AS LAAAMVR P TAAAL PA 

10 20 30 40 50 60 
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70 80 90 100 110 120 

TT1279 . pep I T I C PGE LKLT AS T T S LW AAS AQMALT CSSSKPRI AA I AP T P CGT ADC I S S ARRRT S LT A 

II IIIIMIIIIIII I 111= I 1 I I I I 1 ( = = I i I ( 1 I 1 I ! I I 1 I M I I ( I i I 1 I I I 
g2 7 9 I TTCPGELKLTASTTS PCADSAQ I CLTCS S SKPKMAA I APTPCGTADCI SS ARRRTSLTA 

70 80 90 100 110 120 



130 140 150 

m279 . pep S AKFNAP AAT S AV Y S PRL.C P AT AAGVLP PAS KX 

II! II IMIIIIIIIIMIIIIIIIIhlll 

g279 SAKSNASAATSAVYSPRLCPATAAGVLPPTSKX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3043>: 

a279.seq 

1 ATGACNCNGA TTTGCGGCTG CTTGATTTCA ACGGTTTNNA GGGCTTCGGC 

51 GAGTTTGTCG GCGGCGGGTT TCATGAGGCT GCAATGGGAA GGTACNGACA 

101 CNGGCAGCGG CAGGGCGCGT TTGGCGCCGG CTTCTTTGGC GGCAAGCATA 

151 GCGCGCTCGA CGGCGGCGGC ATTGCCTGCA ATCACGACTT GTCCGGGCGA 

201 GTTGAAGTTG ACGGCTTCAA CCACTTCATC CTGTGCGGAT TCGGCGCAAA 

251 TTTGTTTTAC CTGTTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 

301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA NGCGCACGAG 

351 TTTGACCGCG TCGGCAAAAT CCAATGCGCC GGCGGCAACN AGTGCGGTGT 

401 ATTCGCCGAN GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 

4 51 TCCGAATAG 

This corresponds to the amino acid sequence <SEQ ID 3044; ORF 279.a>: 

a27 9.pep 

1 MTXICGCLIS TVXRASASLS AAG FMRLQWE GTDTGSGRAR LAPASLAASI 
51 ARSTAAALPA ITTCPGELKL TASTTSSCAD SAQICFTCSS SKPRIAAIAP 
101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP AT AAG VL P PA 
151 SE* 

m279/a279 ORFs 279 and 279.a showed a 88.2% identity in 152 aa overlap 

10 20 30 40 50 60 

m27 9.pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 
: I I I M I I I I I I I I I I M I I 1 I : II I II I I i I I I I M I I I I I I I I I : : I I I I I M I I 
a27 9 MTX I CGCL I ST VXRAS AS LS AAG FMRLQWEGTDTGSGRARLAPAS LAAS I ARST AAAL PA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m27 9.pep I TICPGELKLTASTTSLW AAS AQMALTCSSSKPRIAAI APT PCGTADCISS ARRRTSLTA 

II I I M I ! II I M I I I III: : M I I I I I I I I I I I I I I I I II I I I I I I I MINI 
a27 9 ITTCPGELKLTASTTSSCADSAQICFTCSSSKPRIAAIAPTPCGTADCISSARXRTSLTA 

70 80 90 100 110 120 



130 140 150 

m2 7 9 . pep S AK FN A P AAT S A V Y S P RLC PAT AAG VL P PA S KX 

III MINIMUM I II I I I I I I I I I II : I 
a27 9 SAKSNAPAAT SAVYSPXLCPATAAGVLPPASEX 

130 140 150 



519 and 519-1 gnm7.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3045>: 

ra519.seq (partial) 

1 . . TCCGTTATCG GGCGTATGGA GTTGGACAAA ACGTTTGAAG AACGCGACGA 

51 AATCAACAGT ACTGTTGTTG CGGCTTTGGA CGAGGCGGCC GGGgCTTgGG 

101 GTGTGAAGGT TTTGCGTTAT GAGATTAAAG ACT TGGTT CC GCCG CAAGAA 

151 ATCCTTCGCT CAATG CAGGC GCAAATTACT GCCGAACGCG AAAAACGCGC 

201 CCGTATCGCC GAATCCGAAG GTCGTAAAAT CGAACAAATC AACCTTGCCA 
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251 GTGGTCAGCG CGAAGCCGAA 

3 01 GCGGTCAATG CGTCAAATGC 
351 AGGTGAAGCG GAATCCTTGC 

4 01 TCCGTCAAAT TGCCGCCGCC 
451 AATCTGAAGA TTGCGGAACA 
501 AGAAAGCAAT ACGCTGATTA 
551 TGATTTCTGC CGGTATGAAA 



ATCCAACAAT CCGAAGGCGA GGCTCAGGCT 
CGAGAAAATC GCCCGCATCA ACCGCGCCAA 
GCCTTGTTGC CGAAGCCAAT GCCGAAGCCA 
CTTCAAACCC AAGGCGGTGC GGATGCGGTC 
ATACGTCGCT GCGTT CAAC A ATCTTGCCAA 
TGCCCGCCAA TGTTGCCGAC ATCGGCAGCC 
ATTATCGACA GCAGCAAAAC CGCCAAaTAA 



This corresponds to the amino acid sequence <SEQ ID 3046; ORF 519>: 

m5l9 .pep (partial) 

1 . . SVIGRMELDK TFEERDEINS TWAALDEAA GAWGVKVLRY EIKDLVPPQE 

51 ILRSMQAQIT AEREKRARIA ESEGRKIEQI NLASGQREAE IQQSEGEAQA 

101 AVNASNAEKI ARINRAKGEA ESLRLVAEAN AEAIRQIAAA LQTQGGADAV 

151 NLKIAEQYVA AFNNLAKESN TLIMPANVAD IGSLISAGMK IIDSSKTAK* 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3047>: 

g519 . seq 

l atggaatttt tcattatctt gttggcagcc gtcgccgttt tcggcttcaa 

51 atcctttgtc gtcatccccc agcaggaagt ccacgttgtc gaaaggctcg 

101 ggcgtttcca tcgcgccctg acggccggtt tgaatatttt gattcccttt 

151 atcgaccgcg tcgcctaccg ccattcgctg aaagaaatcc ctttagacgt 

201 acccagccag gtctgcatca cgcgcgataa tacgcaattg actgttgacg 

251 gcatcatcta tttccaagta accgatccca aactcgcctc atacggttcg 

301 agcaactaca ttatggcaat tacccagctt gcccaaacga cgctgcgttc 

351 cgttatcggg cgtatggagt tggacaaaac gtttgaagaa cgcgacgaaa 

4 01 tcaacagtac cgtcgtctcc gccctcgatg aagccgccgg ggcttggggt 

4 51 gtgaaagtcc tccgttacga aatcaaggat ttggttccgc cgcaagaaat 

501 ccttcgcgca atgcaggcac aaattaccgc cgaacgcgaa aaacgcgccc 

551 gtattgccga atccgaaggc cgtaaaatcg aacaaatcaa ccttgccagt 

601 ggtcagcgtg aagccgaaat ccaacaatcc gaaggcgagg ctcaggctgc 

651 ggtcaatgcg tccaatgccg agaaaatcgc ccgcatcaac cgcgccaaag 

701 gcgaagcgga atccctgcgc cttgttgccg aagccaatgc cgaagccaac 

751 cgtcaaattg ccgccgccct tcaaacccaa agcggggcgg atgcggtcaa 

801 tctgaagatt gcgggacaat acgttaccgc gttcaaaaat cttgccaaag 

851 aagacaatac gcggattaag cccgccaagg ttgccgaaat cgggaaccct 

901 aattttcggc ggcatgaaaa attttcgcca gaagcaaaaa cggccaaata 

951 a 

This corresponds to the amino acid sequence <SEQ ID 3048; ORF 519.ng>: 

g519 .pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKI EQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAN 

251 RQIAAALQTQ SGADAVNLKI AGQYVTAFKN LAKEDNTRIK PAKVAEIGNP 

3 01 NFRRHEKFSP EAKTAK* 



ORF 519 shows 87.5% identity over a 200 aa overlap with a predicted ORF (ORF 519.ng) 
from N. gonorrhoeae: 

m519/g519 

10 20 30 

m519 . pep SVIGRMELDKTFEERDEINSTWAALDEAA 

I i 1 1 M II 1 M 1 1 M U 1 1 1 1 1 1 ^ 1 1 M II 

g519 YFQVTDPKLASYGSSNYIMAI TQLAQTTLRSVIGRMELDKTFEERDEINSTWSALDEAA 

90 100 110 120 130 140 



40 50 60 70 80 90 

m519 . pep GAWGVKVLRYE I KDIiVPPQEI LRSMQAQITAEREKRARI AESEGRKIEQINLASGQREAE 



WO 99/57280 



75 



PCT/US99/09346 



II I M I I I I M I I I I I II M I I h I I I I I !! M I I I I I I I ! I I I i I ! II I I; I || II I M 

g519 GAWGVKVLRYEIKDLVPPQEILRAMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
150 160 170 180 190 200 

100 110 120 130 140 150 

m5 1 9 . pep I QQS EGEAQAAVNASNAE KI ARI NRAKGEAE S LRLVAEANAEAI RQ I AAALQTQGGADAV 

I MM I II III II IIIMIIIIMI Mill II MIMMIMI I 1 1 1 1 1 1 I 1 1 : 1 M I I 

g5 19 I QQS EGEAQAAVNASNAE KI ARI NRAKGEAES LRLVAEANAE ANRQI AAALQTQS GADAV 

210 220 230 240 250 260 

160 170 180 190 200 

m519 . pep NLKI AEQ YVAAFNNLAKE S NTL I MP ANVAD I G S L - ISAGMKIIDSSKTAK 

Mill i I : : I I ^ I I I I I : I I I Ihlhlh : h Mill 

g519 NLKIAGQYVTAFKNLAKEDNTRIKPAKVAEIGNPNFRRHEKFSPEAKTAK 
270 280 290 300 310 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3049>: 

a51 9 . seq 

1 ATGGAATTTT T CAT TAT CT T GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

401 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

7 01 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TAT CG AC AG C AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3050; ORF 519.a>: 

a519 .pep 

1 MEFFIILLAA VWFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

m519/a519 ORFs 519 and 519. a showed a 99.5% identity in 199 aa overlap 

10 20 30 

m519 .pep S VIGRMELDKT FEE RDE INST WAAL DEAA 

I I I M I I I I I I II ! II I I I I I I | : ] | | | | | 
a519 YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRMELDKTFEERDEINSTWSALDEAA 
90 100 110 120 130 140 

40 50 60 70 80 90 

m519 . pep GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
I I I I I I I M I I I M j I I I II I I I I I I I I M I II I M I I I I I I I I I I It I I I I I I I I I I M 
a519 GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
150 160 170 180 190 200 



100 110 120 130 140 150 
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m519 .pep 



a519 



IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
I I I I I II I I I I i I I I I I I I I I i M I I I M I I I i I I II I I I I I I I M I M I I I M I I I I I I 
IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
210 220 230 240 250 260 



m519 . pep 



a519 



160 170 180 190 200 

NLKI AEQYVAAFNNLAKESNTLIMPANVADIGSLI SAGMKI I DSSKTAKX 
I I M I I I I I M I I M I I I I I I I M I I I I I II I I I I I I I I I I I M I I I I I I 
NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 
270 280 290 300 310 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3051 >: 

m519-l . seq 



1 ATGGAATTTT TCATTATCTT GTTGGTAGCC GTCGCCGTTT TCGGTTTCAA 

51 ATCCTTTGTT GTCATCCCAC AACAGGAAGT CCACGTTGTC GAAAGGCTGG 

101 GGCGTTTCCA TCGCGCCCTG ACGGcCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGTAC TGTTGTTGCG GCTTTGGACG AGGCGGCCGG GGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GAT T AAAG AC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC CGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

7 01 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

8 01 TCTGAAGATT GCGGAACAAT ACGTCGCTGC GTTCAACAAT CTTGCCAAAG 
851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 3052; ORF 519-1>: 

m519-l. 



1 MEFFIILLVA VAVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWA ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 I SAGMKI IDS SKTAK* 



The following DNA sequence was identified in N, gonorrhoeae <SEQ ID 3053>: 

g519-l.seq 



1 ATGGAATTTT TCATTATCTT GTTGGCAGCC GTCGCCGTTT TCGGCTTCAA 

51 ATCCTTTGTC GTCATCCCCC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGATAA TACGCAATTG ACTGTTGACG 

2 51 GCATCATCTA TTTCCAAGTA ACCGATCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCAAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGTAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG GGCTTGGGGT 

4 51 GTGAAAGTCC TCCGTTACGA AAT CAAGG AT TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCGCA ATGCAGGCAC AAATTACCGC CGAACGCGAA AAACGCGCCC 

551 GTATTGCCGA ATCCGAAGGC CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGTG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCCAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GCGAAGCGGA ATCCCTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGGGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTAGCCGC GTTCAACAAT CTTGCCAAAG 
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851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GCATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3054; ORF 519-l.ng>: 

g519-l .pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKT FEE RDEINSTVVS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



m519-l/g519-l ORFs 519-1 and 519-1. ng showed a 99.0% identity in 315 
overlap 

10 20 30 40 50 60 

g5 19-1 .pep MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPF IDRVAYRHSL 
I I I I M I I : I I I I I I I I M I t I I I I I I ! I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
m519-l MEFFI ILLVAVAVFGFKS FWI PQQEVHWERLGRFHRALTAGLNILI PFI DRVAYRHSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

g519-l.pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
I I I I 1 I I M M II I I I I 1 I t I 11 1 M I ! II II I I I I I M I I I 1 I I I 11 I I I I I I I I I I I I 
m519-l KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

70 80 90 100 110 120 

130 140 150 160 170 180 

g5l9-l.pep RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 
I I II I I II I I I I I I I I i I I : I I II I I I I I 1 I I I I I M I I I I I ! I M I I I : I I I I I I I I I I 
m519-l RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

130 140 150 160 170 180 

190 200 210 220 230 240 

g519-l . pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I 11 I I ! I I I M I I I I M I I I I I I I I I M 
m519-l KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

190 200 210 220 230 240 

250 260 270 280 290 300 

g5 19-1. pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I I I II I I I I I I I I I II M I I I I I I I I I I I I I 11 M I I I I M I I I I I I I I 1 I I I I I I I I 1 I 
m519-l LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 

250 260 270 280 290 300 

310 

g519-l.pep ISAGMKIIDSSKTAKX 
I II I I I I 11 11 II I I I 
m5 1 9- 1 I SAGMKI I DSSKTAKX 

310 

The following DNA sequence was identified in N. meningitidis <SEQ ID 305 5>: 

a519-l.seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 



WO 99/57280 



78 



PCT/US99/09346 



401 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

7 01 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3056; ORF 519-l.a>: 

a519-l .pep. 

1 MEFFIILLAA WVFG FKSFV . VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAE I QQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

m519-l/a519-l ORFs 519-1 and 519-1. a showed a 99.0% identity in 315 

overlap 

10 20 30 40 50 60 

a519-l .pep ME FFI I LLAAVVVFGFKS FVV I PQQEVHWERLGRFHRALTAGLN I L I PFI DRVAYRHS L 
M I I I M I : I I : 1 I I I I M I I I I 1 I I I I I I I M I I M I I I I I I I I I I I I t I I I I I I I I I I 
m519-l ME FFI I LLVAVAVFGFKS FWI PQQEVHWERLGRFHRALTAGLNI LI PFI DRVAYRHSL 

10 20 30 40 50 60 . 

70 80 90 100 110 120 

a519-l - pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

I I I I I I I I I M M I I M i I I I II I I I I i I I ! I I 1 II I 1 I I I I II I I I I i II I t i I I I I I I 

m519-l KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

70 80 90 100 110 120 

130 140 150 160 170 180 

a51 9-1 . pep RMELDKTFEERDEINSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
I I II M I I I 1 I M I I I I I I : II I I I M I I I I I II I I I I I M I t I I I I I I I I I 11 I i I I I I 
m519-l RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

130 140 150 160 170 180 

190 200 210 220 230 240 

a519-l .pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
I I I I II I I M I I I M I I I I I I I I I I M I I I I I II ! I I I I I I I i I I I I II I II I I I I I I I I 
IU519-1 KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

190 200 210 220 230 240 

250 260 270 280 290 300 

a519- 1 . pep LVAEANAEAI RQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKE SNTLIMP AN VADIGSL 
I I I I I I M I I I I I IE I I I I I I I I I I I I ! I I I I I 11 I I I 1 I I M I I I I I I I I I I I I I I I M 
m519-l LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 

250 260 270 280 290 300 

310 

a 5 1 9- 1 . pep I SAGMKI I DSSKTAKX 
I I I I M I I I I I I II I I 
m5 1 9 - 1 I SAGMKI I DS S KTAKX 

310 
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576 and 576-1 gnm22.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3057>: 

m576.seq.. (partial) 

1 . . ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

51 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

101 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

151 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

201 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

251 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

301 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

351 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

4 01 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

451 GTGATTCCGG GTTGGACCGA AGgCGTACAG CTTCTGAAAG AAGGCGGCGA 

501 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

551 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

601 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

651 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3058; ORF 576>: 

m576.pep. . (partial) 

1 . • MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

51 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

101 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

151 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

201 KIGAFENAPA KQPAQVDIKK VN* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3059>: 

g576.seq. . (partial) 

1 . .atgggcgtgg acatcggacg ctccctgaaa caaatgaagg aacagggcgc 

51 ggaaatcgat ttgaaagtct ttaccgatgc catgcaggca gtgtatgacg 

101 gcaaagaaat caaaatgacc gaagagcagg cccaggaagt gatgatgaaa 

151 ttcctgcagg agcagcaggc taaagccgta gaaaaacaca aggcggatgc 

201 gaaggccaac aaagaaaaag gcgaagcctt cctgaaggaa aatgccgccg 

251 aagacggcgt gaagaccact gcttccggtc tgcagtacaa aatcaccaaa 

301 cagggtgaag gcaaacagcc gacaaaagac gacatcgtta ccgtggaata 

351 cgaaggccgc ctgattgacg gtaccgtatt cgacagcagc aaagccaacg 

401 gcggcccggc caccttccct ttgagccaag tgattccggg ttggaccgaa 

451 ggcgtacggc ttctgaaaga aggcggcgaa gccacgttct acatcccgtc 

501 caaccttgcc taccgcgaac agggtgcggg cgaaaaaatc ggtccgaacg 

551 ccactttggt atttgacgtg aaactggtca aaatcggcgc acccgaaaac 

601 gcgcccgcca agcagccgga tcaagtcgac atcaaaaaag taaattaa 

This corresponds to the amino acid sequence <SEQ ID 3060; ORF 576.ng>: 

g576.pep. . (partial) 

1 ..MGVDIGRSLK QMKEQGAEID LKVFTDAMQA VYDGKEIKMT EEQAQEVMMK 

51 FLQEQQAKAV EKHKADAKAN KEKGEAFLKE NAAEDGVKTT ASGLQYKITK 

101 QGEGKQPTKD DIVTVEYEGR LIDGTVFDSS KANGGPATFP LSQVIPGWTE 

151 GVRLLKEGGE ATFYIPSNLA YREQGAGEKI GPNATLVFDV KLVKIGAPEN 

201 APAKQPDQVD IKKVN* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m576/g576 ORFs 576 and 576. ng showed a 97.2% identity in 215 aa overlap 

10 20 30 40 50 60 

m57 6 . pep MQQASYAMGV DI GRSLKQMKEQGAE I DLKVFTEAMQAVYDGKE IKMTEEQAQEVMMKFLQ 

M M I I I I ! I I I I I I | | 1 | M | | M : 1 I I I I I I II I I I I I I | I I | | | | | | 1 M 
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g57 6 MGVDIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVMMKFLQ 

10 20 30 40 50 

70 80 90 100 110 120 

mS7 6 . pep EQQAKAVEKHKADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIV 
I M I I I I II I I I I I I f I I M I I I i I I I I I I : I I I I M I i I I I I I I I I I I I II I I I I I I I I 
g57 6 EQQAKAVEKHKADAKANKEKGEAFLKENAAEDGVKTTASGLQYKITKQGEGKQPTKDDIV 
60 70 80 90 100 110 

130 140 150 160 -170 180 

m57 6 . pep TVEYEGRLIDGTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYRE 
I 1 II I I II I I I I M I It M I II I : II I M II M I i I I I S : M II I 1 I M I i II I I I I I I I 
g57 6 TVEYEGRLIDGTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEATFYIPSNLAYRE 
120 130 140 150 160 170 

190 200 210 220 

Itl57 6 . pep QG AG DK I GPN AT L V FDVKLVK I G APEN AP AKQP AQV D I KKVNX 

I I I I : I I M ! M I I I I 11 li M U I II M M I I II I I II i M 
g57 6 QGAGEKIGPNATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
180 190 200 210 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 306 1>: 

a576. seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

7 01 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3062; ORF 576.a>: 

a576.pep 

1 MNTIFKISAL TLSAALALSA CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

m576/a576 ORFs 576 and 576. a showed a 99.5% identity in 222 aa overlap 

10 20 30 

m57 6 - pep MQQASYAMGVDIGRSLKQMKEQGAEIDLKV 

I I I M i I I M I I I M I I I I I I I M M I I i I 
a57 6 CGKKEAAPAS ASE PAAAS SAQGDTSS I GSTMQQAS YAMGVDIGRSLKQMKEQGAE I DLKV 

30 40 50 60 70 80 

40 50 60 70 80 90 

m57 6 . pep FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 
I M I I! U I 1 I I I M M I I I I M M II 1 I I M 1 I I I I I I II I I II I II II II II I I I I M 
a57 6 FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 

90 100 110 120 130 140 
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100 110 120 130 140 150 

m57 6.pep KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQ 
I I I II I I M I I I M I I I M I I I I I I M I I I I II I I I I I I M I l! I I I ! 1 I I I I U I I U I 
a576 KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQ 

150 160 170 180 190 200 



160 170 180 190 200 210 

m57 6 . pep VI PGWTEGVQLLKEGGEATFYI PSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

I I I I I I I M I I I I I I II I U I M ! I I I ! I I I I I I I I M I I M I I II ! I I I I M I I I I I I 
a576 VILGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

210 220 230 240 250 260 



220 

m5 7 6 . pep KQPAQVD I KKVNX 

IMM III Mill 
a576 KQPAQVD I KKVNX 

270 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3063>: 

m576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

451 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

8 01 CATCAAAAAA GTAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 3064; ORF 576-l>: 

m576-l.pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDS SKAN GGPVTFPLSQ 

201 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN AT LVFDVKLV 

251 KIGAPENAPA KQPAQVD IKK VN* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3065>: 

g576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTGCCGCG CAGGGCGACA CCTCTTCAAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC AATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 ACAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGATG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCCCAGGAAG TGATGATGAA ATTCCTGCAG GAGCAGCAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGATG CGAAGGCCAA CAAAGAAAAA GGCGAAGCCT 

4 01 TCCTGAAGGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGT 

4 51 CTGCAGTACA AAATCACCAA ACAGGGTGAA GGCAAACAGC CGACAAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACCGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG CCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACGG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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701 GCGAAAAAAT CGGTCCGAAC GCCACTTTGG TATTTGACGT GAAACTGGTC 
7 51 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG ATCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3066; ORF 576-Lng>: 

g 57 6-1. pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASAA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTDAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPATFPLSQ 

201 VIPGWTEGVR LLKEGGEATF YIPSNLAYRE QGAGEKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPDQVDIKK VN* 



g576-l/m576-l ORFs 576-1 and 576-1. ng showed a 97.8% identity in 272 aa 

overlap 

10 20 30 40 50 60 

g576-l . pep MNTI FKI SALTLS AALALSACGKKEAAPASASEPAAASAAQGDTS S I GSTMQQAS YAMGV 
i I I I I ! I i I I I M 1 I 1 ! i i I 1 1 I I I 1 1 ; I t ) I I I I HI : ! I I ! II 1 : I I M 1 I I 1 I I ! I 1 
m5 7 6- 1 MNT I FKI SALT LS AALALS ACGKKE AAP AS ASE P AAAS SAQGDTS SIGSTMQQAS YAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

g57 6-1 . pep DIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVbdMKFLQEQQAKAVEKH 
I I I f I I i I I I I I I I I I I I I I I I : I I I I I M II I I I I I If I I I I I I I I I I I I i i I I I I I I I 
m576-l DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

g576-l . pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
I I I M I I I I I i I II M M I I I M I M II I I I I II II i I I II I I I M II I II I 11 I! I I t I 
m57 6-l KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 

130 140 150 160 170 180 

190 200 210 220 230 240 

g57 6-1 . pep GTVFDSSKANGG PAT FPL SQV I PGWTEGVRLLKEGGEATFY I PSNLAYREQGAGEK I GPN 

I I M I M I I 11 I I : 1 M i 1 I I I I I II H I : I I I 1 I I M M I I M M M M M I I : M I I I 
m57 6-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

g57 6-l .pep ATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 

II I I I I I i I I I 1 I I I I M M II I I I I I I I 1 I I 
m57 6- 1 ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3067>: 

a57 6-1 . seq 

1 ATGAACACCA 

51 ACTTTCCGCC 

101 CTGCCGCCGC 

151 ATGCAGCAGG 

201 GCAAATGAAG 

251 CCATGCAGGC 

301 GCTCAGGAAG 

351 AGAAAAACAC 

401 TTCTGAAAGA 

451 CTGCAATACA 

501 CGACATCGTT 

551 TCGACAGCAG 

601 GTGATTCTGG 

651 AGCCACGTTC 



TTTTCAAAAT CAGCGCACTG 
TGCGGCAAAA AAGAAGCCGC 
TTCTTCCGCG CAGGGCGACA 
CAAGCTATGC GATGGGCGTG 
GAACAGGGCG CGGAAATCGA 
AGTGTATGAC GGCAAAGAAA 
TCATGATGAA ATTCCTTCAG 
AAGGCGGACG CGAAGGCCAA 
AAATGCCGCC AAAGACGGCG 
AAATCACCAA ACAGGGCGAA 
ACCGTGGAAT ACGAAGGCCG 
CAAAGCCAAC GGCGGCCCGG 
GTTGGACCGA AGGCGTACAG 
TACATCCCGT CCAACCTTGC 



ACCCTTTCCG CCGCTTTGGC 
CCCCGCATCT GCATCCGAAC 
CCTCTTCGAT CGGCAGCACG 
GACATCGGAC GCTCCCTGAA 
TTTGAAAGTC TTTACCGAAG 
TCAAAATGAC CGAAGAGCAG 
GAACAACAGG CTAAAGCCGT 
TAAAGAAAAA GGCGAAGCCT 
TGAAGACCAC TGCTTCCGGC 
GGCAAACAGC CGACCAAAGA 
CCTGATTGAC GGTACGGTAT 
TCACCTTCCC TTTGAGCCAA 
CTTCTGAAAG AAGGCGGCGA 
CTACCGCGAA CAGGGTGCGG 
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701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 
751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3068; ORF 576-l.a>: 

a576-l .pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

a576-l/m576-l ORFs 576-1 and 576-1. a showed a 99.6% identity in 272 

overlap 

10 20 30 40 50 60 

a576-l .pep MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 
I I I I I I I I I II I I i I I I I I I I M I II i I I I I I I I I I I I I I I I M I I I I If I II I M I I I I 
m576-l MNT I FKI SALT LS AALAL S ACGKKE AAPAS AS E PAAAS SAQGDT S S IG STMQQAS YAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

a57 6-1 . pep DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
I I i II I II I I I I I I I I I M M II I I M I I II I II I II I I I I I I II I I I I I II I I M I I if 
m57 6-l DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

a57 6-1 . pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID . 

I I I I I 1 i I I ( M I I I 1 I II 1 I M M I I 1 i I I M I 1 M I i M I 11 M I 1 1 I i I f I M I I 1 t 
m57 6- 1 KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
130 140 150 160 170 180 

190 200 210 220 230 240 

a57 6-l.pep GTVFDSSICANGGPVTFPLSQVILGWTEGVQLLKEGGEATFYIPSNLAYRE QGAGDKIGPN 
I I I I I I II I I I I I I I I I I II I I I M I I I I I I II I I I II I I I I I M I M I I I I I I I I I I I 
m57 6-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

a57 6-1 . pep ATLVFDVKLVKI GAPE NAPAKQ PAQVD I KKVNX 
I I I f I I I I II I I I I I I I I I I II I I I I I I I I II I 
m57 6-1 ATLVFDVKLVKIGAPENAPAKQ PAQVD I KKVNX 

250 260 270 



919 gnm43.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3069>: 

n\919 . seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TACGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTC TATACCGTTG TACCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCTTTCAGG CAAAACAGTT 

3 51 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAG CCTTG 

4 01 CCGGTACGGT TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGACAGG 
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451 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG A CGATTTT AT 

5 01 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCcG ATTCCCCATC ACCGCGCGCA CAACAGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

751 GAAGACCCTG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCyTA CGTTTCCATC GGACGCTATA TGGCGGATAA GGGCTACCTC 

901 AAACTCGGAC AAACCTCCAT GCAGGGCATT AAGTCTTATA TGCGGCAAAA 

951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCCGCTGA TGGGGGAATA TGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGTGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTC CTACCCAACG 

13 01 GTATGAAGCC CGAATACCGc CCGTAA 



This corresponds to the amino acid sequence <SEQ ID 3070; ORF 919>: 

m9l9 .pep 

1 MKKYLFRAAL YGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

2 01 HTADLSRFPI TARTTAI KGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 
251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

3 01 KLGQTSMQGI KSYMRQNPQR LAEVLGQNPS YIFFRELAGS SNDGPVGALG 
351 TPLMGEYAGA VDRHYI TLGA P IiFV ATAH PV TRKALNRLIM AQDTGSAIKG 
401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 307 1>: 

g919 . seq 

1 ATGAAAAAAC ACCTGCTCCG CTCCGCCCTG TACGGcatCG CCGCCgccAT 

51 CctcgCCGCC TGCCAAAgca gGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGG CCGG CCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TTGCCGGCGG CGGGGCCGTC TAT AC CGTTG TGCCGCACCT 

201 GTCCATGCCC CACTGGGCGG CGCaggATTT TGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

3 01 TGCGCCCAAG CCTTTCAAAC CCCCGTGCAT TCCTTTCAGG CAAAGcGgTT 
351 TTTTGAACGC TATTTCACGC cgtGGCaggt tgcaggcaAC GGAAGc CTTG 

4 01 Caggtacggt TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGGCAGG 
451 CGGACGGAAC GGGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 
501 CTCCGTCCCG CTGCCTGCCG GTTTGCGGGG CGGAAAAAAC CTTGTCCGCA 

5 51 TCAGGCAGac ggGGAAAAAC AGCGGCACGA TCGACAATGC CGGCGGCACG 
601 CATACCGCCG ACCTCTCCCG ATTCCCCATC ACCGCGCGCA CAACGGcaat 
651 caaaGGCAGG TTTGAaggAA GCCGCTTCCT CCCTTACCAC ACGCGCAACC 
701 AAAtcaacGG CGGCgcgcTT GACGGCAAag cccCCATCCT CggttacgcC 
751 GAagaccCcG tcgaacttTT TTT CATGC AC AtccaaggCT CGGGCCGCCT 
801 GAAAACCCcg tccggcaaat acatCCGCAt cggaTacgcc g a c AAAAACG 
851 AACAtccgTa tgtttccatc ggACGctaTA TGGCGGACAA AGGCTACCTC 
901 AAGctcgggc agACCTCGAT GCAGGgcatc aaagcCTATA TGCGGCAAAA 
951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC GGCAATGAGG GCCCCGTCGG CGCACTGGGC 

1051 ACGCCACTGA TGGGGGAATA CGCCGGCGCA ATCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CAGGCAGCGC GATCAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGTTAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATACGT CTGGCAGCTC CTGCCCAACG 

1301 GCATGAAGCC CGAATACCGC CCGTGA 
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This corresponds to the amino acid 



g9l9.pep 
1 


MKKHLLRSAL 


YGIAAAILAA 


51 


GTTVAGGGAV 


YTWPHLSMP 


101 


CAQAFQTPVH 


SFQAKRFFER 


151 


RTERARFPIY 


GIPDDFISVP 


201 


HTADLSRFPI 


TARTTAI KGR 


251 


EDPVELFFMH 


IQGSGRLKTP 


301 


KLGQTSMQGI 


KAYMRQNPQR 


351 


TPLMGEYAGA 


IDRHYITLGA 


401 


AVRVDYFWGY 


GDEAGELAGK 



nee <SEQ ID 3072; ORF 919.ng>: 

CQSRSIQTFP QPDTSVINGP DRPAGIPDPA 
HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 
YFTPWQVAGN GSLAGTVTGY YEPVLKGDGR 
LPAGLRGGKN LVRIRQTGKN SGTIDNAGGT 
FEGSRFLPYH TRNQINGGAL DGKAPILGYA 
SGKYIRIGYA DKNEHPYVS I GRYMADKGYL 
LAEVLGQNPS YIFFRELAGS GNEGPVGALG 
PLFVATAHPV TRKALNRL I M AQDTGSAIKG 
QKTTGYVWQL LPNGMKPEYR P* 



ORF 919 shows 95.9 % identity over a 441 aa overlap with a predicted ORF (ORF 919.ng) 
from N. gonorrhoeae; 

m919/g919 

10 20 30 40 50 60 

m9l9 - pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

HhhhIIIMHMMIIIhlMlililllliMMMhMIMIH 1 1 0 1 1 1 1 

g 9 19 MKKHLLRSALYGI AAAILAACQSRS IQTFPQPDTSVINGPDRPAGI PDPAGTTVAGGGAV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m919 . pep YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWODVCAQAFQTPVHSFQAKQFFER 

MlllllhllMMIIMIMIMlMMIIlllllllMMIllllMlMlhllll 

g919 YTWPHLSMPHWAAQDFAKSLQS FRLGC ANLKNRQGWQDVCAQAFQT PVHS FQAKRFFER 

70 80 90 100 110 120 



130 140 150 160 170 180 

m919 . pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

IIIIMIMIIMIIIMIMMIIIII III :|lll!IIIIIMIIIIIIMIh!l 

g919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPL.PAGLRGGKN 
130 140 150 160 170 180 



190 200 210 220 230 240 

m919.pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

Ml IM III!! MilhIM IMil IMMIi'l I II ! i I IN I Mill MM I i Ml 1 1! 

g919 LVRI RQTGKNSGTIDNAGGTHTADLSRFPI TARTTAI KGRFEGSRFLPYHTRNQINGGAlt 

190 200 210 220 230 240 



250 260 270 280 290 300 

m9 1 9 . pep DGKAP I LGYAEDPVELFFMHI QGSGRLKTPSGKY I RI GYADKNEHPYVS I GRYMADKGYL 

M I I I I ! I II I I I II I ! I I I I ! I I I I i I ! I I i 1 1 I ! I I 1 1 I 1 1 I II 1 1 I ! I ! I I I M I 

g919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGRYMADKGYL 

250 260 270 280 290 300 



310 320 330 340 350 360 

m919 . pep KLGQTSMQGI KSYMRQNPQRLAEVLGQNPSYI FFRE LAGS SNDGPVGALGTPLMGEYAGA 

I II MIMMM M IMM M Mil I I I I MMM MM i M MUM II IM I MM M 

g919 KLGQTSMQGI KAYMRQNPQRLAEVLGQNPSYI FFRELAGSGNEGPVGALGTPLMGEYAGA 

310 320 330 340 350 360 

370 380 390 400 410 420 

m919.pep VDRHYI TLrGAPLFVATAHPVTRKALNRLIMAQDTGSAI KGAVRVDYFWGYGDEAGELAGK 

= M 1 1 1 1 1 M 1 1 I ! 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i I M 1 1 M 1 1 1 1 1 

g919 IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAI KGAVRVDYFWGYGDEAGELAGK 

370 380 390 400 410 420 
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430 440 
m9 1 9 . pep QKTTGYVWQLLPNGMKPEYRPX 

I : I M I I I I M i I I Millll I 

g 9 1 9 QKTTGYVWQLLPNGMKPEYRPX 

430 440 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3073>: 

a919 . seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TGCGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG .CGGGGCCGTT TATACCGTTG TGCCGCACCT 

2 01 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCGTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

401 CCGGTACGGT TACCGGCTAT TACGAGCCGG TGCTGAAGGG CGACGACAGG 

4 51 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCCA ATTCCCCATC ACTGCGCGCA CAACGGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

7 51 GAAGACCCCG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCCTA CGTTTCCATC GGACGCTATA TGGCGGACAA AGGCTACCTC 

901 AAGCTCGGGC AGACCTCGAT GCAGGGCATC AAAGCCTATA TGCAGCAAAA 

951 CCCGCAACGC CTCGCCGAAG TTTTGGGGCA AAACCCCAGC TATATCTTTT 

1001 TCCGAGAGCT TACCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCCGCTGA TGGGCGAGTA CGCCGGCGCA GTCGACCGGC AC T ACAT T AC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTT CTGCCCAACG 

1301 GTATGAAGCC CGAATACCGC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 3074; ORF 919.a>: 

a919.pep 

1 MKKYLFRAAL CGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SVQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSQFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KAYMQQNPQR LAEVLGQNPS YIFFRELTGS SNDGPVGALG 

351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

4 01 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

m919/a919 ORFs 919 and 919.a showed a 98.6% identity in 441 aa overlap 

10 20 30 40 50 60 

m919.pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
I ! I I I I I I I I M I ! I I I I I I I ! I II I ! M I 1 II ! I I ! I M I I I I I I I I I I I I! ! M II I 
a919 MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m919 . pep YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
I I M I I M 1 I I I I I I I I I i I I I I I I I I II I I I I I I I M I I I I I I II I f I I I I 1 I I I I I I 
a 91 9 YTVVPHLSLPHWAAQDFAKS LQSFRLGCAN LKNRQGWQDV CAQAFQTPVHSVQAKQFFER 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m919.pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
I I M M I I I I I I I I I i I I II I I M I I I I I f I I I I I I I I II M I 1 M I I I I II I I I I f | M 
a919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m919 . pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
M 11 I I I M I I I I I I I I I I I I I I I I I : | I I | U I I I ! I I I I I II I I | M I I I I M I I I I j 
a919 LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919 . pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

I I 1 I I I M I I I I I I I I t M I I I I M I I I I I I I I I I I I I I I | | | I | | | | | | I | | M 

a919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m919 . pep KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
I I M M I I I I I : I I : I I I I I I I I I I I I M I I I I I I I I : I I I I I I I I | | | I I I II I I M I I 
a919 KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

310 320 330 340 350 360 

370 380 390 400 410 420 

m919 . pep VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
M I I I M I I I I I I I I I I II I I I I I I I I I M I I I II I I M M I I I I I I I I I I I I I I I I I I I 
a919 VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 

370 380 390 400 410 420 

430 440 
m9 1 9 . pep QKTTGYVWQLLPNGMKPEYRPX 
I II M I I I I I I I I I I I I I I I | | 
a 9 1 9 QKTTGYVWQLLPNGMKPEYRPX 
430 440 



121 and 121-1 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3075>: 

ml21.seq 

1 AT GGAAAC AC AGCTTTACAT CGGCATCATG TCGGGAACCA GGATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

4 01 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

451 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

501 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

551 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

601 xxxxxxCAGC TTCCTTACGA CAAAAACGGT GCAAAGTCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACGCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGACG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGnATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 



WO 99/57280 



88 



PCT/US99/09346 



1051 GCAACCGGCG CATCCAAACC GTGTATTCTG AnCGCGGGAT ATTATTATTG 
1101 A 

This corresponds to the amino acid sequence <SEQ ID 3076; ORF 12 1>: 

ml 21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3077>: 

gl21 . seq 

1 ATGGAAACAC AG C T T T AC AT CGGCATTATG TCGGGAACCA GTATGGACGG 

51 GGCGGATGCC GTGCTGGTAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCTGACC GGTTGCGCCG CAAATTGCTG 

151 GATTTGCAGG AC AC AGG C AC AGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCTCCGTGC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTtac AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACTGa cgcggatttT TACCGTCggc gacttcCGCA 

4 01 GCCGCGACCT TGCTGCCGGC GGacaAGGTG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA TGACAGGGAA ACACGCGTGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGGCGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGAcgcgtg gacgcaggca 

601 cacTGGcagc TGCCTTACGA CAAAAacggt gcAAAGgcgg cacAAGGCAA 

651 catatTGCcg cAACTGCTCG gcaggctGCT CGCCcaccCG TATTTCTCAC 

701 AACCCcaccc aaAAAGCACG GGgcGCGaac TgtttgcccT AAattggctc 

751 gaaacctAcc ttgacggcgg cgaaaaccga tacgacgtat tgcggacgct 

801 ttcccgattc accgcgcaaA ccgTttggga cgccgtctca CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCTCAATG GGTGGAGGCG gccgCATTtg 

1001 cgtggttggC GGCGTGTTGG ATTAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCGACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3078; ORF 121.ng>: 

gl21.pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRWLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 
from TV. gonorrhoeae: 

m!21/gl21 

10 20 30 40 50 60 

ml2l.pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I M I I I I I I I I I I I I I I I I : I I I I II I I I I I I II I I I I I I I I I : I I I I I I I I : I I I 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21.pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I M I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I If 
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gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 

gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: : I I M I I I I M : I I I I I I II I I I I I I I I I I : I Mill 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml 21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIR1S1PV 
M I I I I : I t I I It I M I I 1 I 11 I 11 I i I !) I I I i I I I I I I I I I II I 1 I M I 1111111 

gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml21.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 

II M t I I II I II I I I I I I I : I I M I I I I ! I I I ! I I II M M I I [ 1 I M I M I I I I M II 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
MINI 

gl21 GAGYYYX 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 3079>: 



a!21 . seq 



1 


ATGGAAACAC 


AGCTTTACAT 


51 


GGCGGATGCC 


GTACTGATAC 


101 


AAGGGCACGC 


CTTTACCCCC 


151 


GATTTGCAGG 


ACACAGGCGC 


201 


GCAAGAACTC 


AGCCGCCTGT 


251 


GTCAAAACCT 


CGCGCCGTCC 


301 


ACCGTCAGAC 


ACGCGCCGGA 


351 


GCCGCTGCTG 


GCGGAACGGA 


401 


GCCGCGACCT 


TGCGGCCGGC 


451 


CACGAAGCCC 


TGTTCCGCGA 


501 


CGGCGGGATT 


GCCAACATCA 


551 


GCTTCGACAC 


AGGACCGGGC 


601 


CACTGGCAGC 


TTCCTTACGA 


651 


CATATTGCCG 


CAACTGCTCG 


701 


AACCCCACCC 


TAAAAGCACG 


751 


GAAACCTACC 


TTGACGGCGG 


801 


TTCCCGATTC 


ACCGCGCAAA 


851 


CAGATGCCCG 


TCAAATGTAC 


901 


TTAATGGCGG 


ATTTGGCAGA 


951 


CACCGCCGAA 


CTGAACCTCG 


1001 


CATGGATGGC 


GGCGTGTTGG 


1051 


GCAACCGGCG 


CATCCAAACC 


1101 


A 





CGGCATCATG TCGGGAACCA GCATGGACGG 
GGATGGACGG CGGCAAATGG CTGGGCGCGG 
TACCCCGGCA GGTTACGCCG CAAATTGCTG 
GGACGAACTG CACCGCAGCA GGATGTTGTC 
ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 
GACATTACCG CCCTCGGCTG CCACGGGCAA 
ACACAGTTAC AGCGTACAGC TTGCCGATTT 
CTCAGATTTT TACCGTCGGC GACTTCCGCA 
GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 
CGACAGGGAA ACACGCGCGG TACTGAACAT 
GCGTACTCCC CCCCGACGCA CCCGCCTTCG 
AATATGCTGA TGGACGCGTG GATGCAGGCA 
CAAAAACGGT GCAAAGGCGG CACAAGGCAA 
ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 
CGAAAACCGA TACGACGTAT TGCGGACGCT 
CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 
ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 
GTCAACCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG GGCGCGGGAT ATT AT TAT TG 



This corresponds to the amino acid sequence <SEQ ID 3080; ORF 121.a>: 

al21.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 
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301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 
351 ATGASKPCIL GAGYYY* 

ml21/al21 ORFs 121 and 121. a showed a 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml 21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I II I M ! I I I I I I I I I! M I I I I M I I I I I I M I! H I I I I M ! I M : I I I M I I II I I ! 
al21 METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I I M i I I I I I i I I I I I II I i I I I I I I II M I I I I II I I I I It : I I : I I II I I I I 
al21 HRSRMLSQELSRLYAQTAAELLCSQNIAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : 
a 121 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: II I I I I I II I : I I I I I I I I I I I I I I I I I I I I I I I I II 

al21 PAFGFDTGPGNMLMDAVJMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 21 .pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I i t I I I : M 1 I M ! I M I I I ! II I M I I I I I M II I I I I I I I I M I I I I 1 I I II I I I I 
al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml21 - pep LMADIJVECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I II II II I I I I I I II I I : I I I I I I I 1 I I I I I I : I I II : II I I I I I I I I I I I II I I I I 
al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

a!21 GAGYYYX 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 308 1>: 

ml 2 1-1 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC G TACT GAT AC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AG CAT AC AG C TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

7 01 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 
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851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG AMCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3082; ORP 121-1>: 

ml21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 ORFs 121-1 and 121. ng showed a 95.6% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I [ I I II I II I I I M I I I M I I I : I I I I I I I I f I I I I M I I I I I i I I : I I I I I I II : I I I 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21-l.pep HRSRI LSQELSRLYAQTAAELLCSQNLAPS DITALGCHGQTVRHAPEHGYS IQLADLPLL 
I I II : I I M I I I I I M I I I I M I II I I I I I II I I I I I I M I II I I I I I I I I I I M I I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYS IQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l .pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
I I I I I I I I I I I I I I II I I M II I i I I I I I I II I I I I : I I I I : I I I I I I I I I M I I I I i 
gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l.pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I I II I I I I f I I M I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I 
gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l.pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I M II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I II I I I I! I I I I I I I I I I I I I I I 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I II I I I I M I I I! II I I : I I M I M I I I I I M I I II I I I I I I I I I M I M I f I M I I 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
MINI 

gl21 GAGYYYX 



The following DNA sequence was identified in N. meningitidis <SEQ ID 3083>: 

al21-l.seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 
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51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA AC AC AG TT AC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT AT TAT TAT TG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 3084; ORF 121-l.a>: 

a!21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21-l/al21-l ORFs 121-1 and 121-1 -a showed a 96.4% identity in 366 aa overlap 



10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I M I I M ! M I I I M I [ I I I II I I I I I ! I I I I ! I I I I I I I M I I I I I :! I I I [ I I I I I M 
al21-l METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFT PYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 



70 80 90 100 110 120 

10121-1. pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
II I I : I It I I I I I I f I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I : I I I I I M I 
al21-l HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml21-l . pep AERTRI FT VGD FRS RDLAAGGQGAPLVPAFHEALFRDNRETRAVLN I GG IAN I S VL PPDA 
I I I I = I I ! I I I I I I I i I I I I I I I I I I I I I I I I I I I I I : I I I I I I i I I I I I I I I I I I I I I I 
al21-l AERTQI FT VGD FRS RDLAAGGQGAPLV PA FHEALFRDDRETRAVLN I GG I ANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l . pep P AFG FDTG PGNMLMDAWTQAHWQL P YDKNGAKAAQGN I L PQLLDRLLAH P Y FAQPH PKST 
I i I It I M I M I I M I t M I I I ! I I 1 I 11 I I I M I I I I I M 1 I I II II I 1 I 1 11 I I I I I 
al21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml21-l . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMY ICGGGIRNPV 
M I 1 I I I I 11 M I I M I I I I I I 1 1 M I 1 1 I I I I 1 I I I I I I I I I I I I I I I ! I I I I M I I I 
al21-l GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVS HAAADARQMY ICGGGIRNPV 

250 260 270 280 290 300 
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310 320 330 340 350 360 

ml21-l.pep L^DLAECFGTRVSLHSTADLNLDPQWVEAMFAWLAACWINRIPGSPHKATGASKPCIL 
I I I 1 I i I 11 : I 1 I 1 1 1 I I I : I I I I M I I 1 I I I I I : I I I I : I I I I I I 1 I I I I ! I t I I I M 
a i21 L^DLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
M M II 

al21 GAGYYYX 



128 and 128-1 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 3085>: 

ml28.seq (partial) 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTGCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 

1 T ACGCC AG CG AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 

51 wGTCAAAAAA TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 

101 AAmTCAAAAA ACTmTACGGC ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 

151 TGGCACAAAG ACGTGCGCTA TTkTGAATTG CAACAAAACG GCGAAmCCAT 

201 AGGCGGCGTT TATATGGATT TGTACGCACG CGAAGGCAAA CGCGGCGGCG 

251 CGTGGATGAA CGACTACAAA GGCCGCCGCC GTTTTT CAGA CGGCACGCTG 

301 CAAyTGCCCA CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 

3 51 CAGGGAAGCC CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 

4 01 CCGGACACGG GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGGCGTA 
4 51 TCCGGCATCA ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 
501 TATGGAAAAT TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 
551 ACGAAGAAAC CGGcgTTCCC yTGCCGAAAG AACTCTTsGA CAAAwTGCTC 
601 GCCGCCAAAA ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
651 CGCCCTCTTT GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 
701 AAAACTGGCA ACAGGTTTTA GACAGCGTGC GCAAAAAAGT CGCCGTCATC 
751 CAGCCGCCCG AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
801 AGGCGGCTAT TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGTATTGA 
851 GCGCGGACGC ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 
901 GGCAAACGCT TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
951 nGCAGAATCC TTCAAAGCCT TCCGCGGCCG CGAACCGAGC ATAGACG C AC 

1001 TCTTGCGCCA CAG CGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ID 3086; ORF 128>: 

ml28.pep (partial) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA I KAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTF ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKIi NH 



// 



1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRFSDGTL, 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 

301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3087>: 

gl28 . seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AAC CAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACG CTCGAAA ACGCATTGAA 

801 AACCGccaaa CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

12 01 ATCGGCGGCG TTTATATGGA TTTGTACG C A CGCGAAGGCA AACGCGGCGG 
1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

13 01 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GG AC TGCACC ACCTGCTTAC C CAAGTGG AC GAACTGGGCG 
14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 
1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 
1551 C CACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 
1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATG CCGTCT 
1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 
1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC T ACG CATGGG CCGAAGTCCt 
1851 CAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 
1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 
1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 3088; ORF 128.ng>: 

gl28 .pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPIAGI P ED AIiAM FAA . 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE I LTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGEPLPKELF DKMLAAKNFQ RGMFUVRQME 

551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN R FANS FGH I F 

601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 



WO 99/57280 



95 



PCT/US99/09346 



651 AAESFKAFRG REPSIDALLR QSGFDNAA* 



ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/g!28 



10 20 30 40 50 60 

gl2 8 . pep M I DNALLHLGE E PRFNQ I QTED I KPAVQTAI AEARGQ I AAVKAQTHTG WANTVERLTG I T 

I IIIIMIIMIIhlhillMlhllMMII i ilhi! lii Mill ill lllll 

ml 2 8 MTDNALLHLGEEPRFDQ I KTED I KP ALQTAI AEAREQIAAI KAQTHTGW ANTVEPLTG I T 

10 20 30 40 50 60 



70 80 90 100 110 120 

g 1 2 8 . pep ERVGRI WGWSHLNSWDT PELRAV YNELMPE ITVFFTEI GQDI ELYNRFKT I KNS PEFA 

1 1 1! I M 11 1 II M hIMIMIillllMlllliHIIIIIMNIMMIIIIUI 

ml 2 8 ERVGR I WG WSHLNCVADTPE LRAVYNE LMPE I TVFFTE I GQD I EL YNRFKTI KNS PEFD 

70 80 90 100 110 120 



130 140 150 160 170 180 

gl28 . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

I I I I I I I I I h I 
ml 2 8 TLS PAQKTKLNH 

130 

// 

340 350 360 

gl28 . pep YAGEKLREAKYAFSETEVKKYFPVGKVLAG 

IhlMMMIIIMl MIIMM 1 1 I 

ml 2 8 Y AS EKLRE AKYAFS ETX VKKYF P VGXVLNG 

10 20 30 



370 380 390 400 410 420 

gl28 .pep LFAQIKKLYGIGFAEKTVPWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWMNDYK 

1 1 II IIMIMhlllllllllllll lllllh:|IIIIIIIIIIIMIIIIMIIII 

m!28 LFAQXKKLYG I GFTEKTVPVWHKDVRYXELQQNGEX I GGV YMDLYAREGKRGGAWMNDYK 

40 50 60 70 80 90 



430 440 450 460 470 480 

gl28 . pep GRRRFADGTLQLPTAYLVCNFAP PVGGKEARLSHDE I LTL FHETGHGLHHLLTQVDELGV 

Ml ImM MM! Ml II I IMM! IMIIIMI MM IIIIIIMMIIIIIIIIMI 

ml28 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVDELGV 

100 110 120 130 140 150 



490 500 510 520 530 540 

gl28 . pep SG I NGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGEPLPKELFDKMLAAKNFQRGMF 

Mllll MIIIIMIIIIMMMIIMI lllllll 1 1 1 1 1 1 II IIIIMI III 

ml 2 8 SGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQXGMF 

160 170 180 190 200 210 



550 560 570 580 590 600 

gl28 .pep LWQMEFALFDMMIYSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

III IMIIIIMIIhll U 1 I M! , 1: 1 1 I: I 1 IMM 1 1 1! 1 1 IMIIIIIM 

ml28 XVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIFAGGY 

220 230 240 250 260 270 



610 620 630 640 650 660 

g 1 2 8 . pep SAGYYS YAWAEVLS TDAY AAFEE S DDVAATG KR FWQ E I LAVGGSRSAAES FKAFRGREPS 

Ih I I M I! IN M III II I IMHIIIil II III MM I 1 1 1 = 1 1 1 1 1 I 1 1 1 1 1 1 I 

m!28 S AAXY S YAWAEVLS AD AY AAFEESDDVAATGKRFVJQE I LAVGXSRSGAES FKAFRGREPS 

280 290 300 310 320 330 
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670 679 
g 1 2 8 . pep I DALLRQSGFDNAAX 

I M I I I : I I li I I : 
ml 2 8 I DALLRHSGFDNAVX 

340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3089>: 

al28 . seq 

1 AT G ACT G AC A ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACG C AC AC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA AC AG C AG G C A 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATT CG AC A ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GT ATT AAACG GACTGTTCGC 

1101 C C AAAT C AAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG C C AAAT G GAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGAC7AACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 3090; ORF 128.a>: 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 ORFs 128 and 128.a showed a 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml28 . pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
M II ! I ! I I I I I I I I I U M ! 1 ! M I II M I I I M I I 1 I I I I I I I ! ! !! ! i | | | | | | | M 
al28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 2 8 . pep ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
Mllllilllllli I : I i 1 II I I : M M I M M I I I I 11 i I M I I I M I I I I I I 11 1 I I 
al28 ERVGRIWGWSHLNSVTDT.PELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 

ml28 . pep TLS PAQKTKXNH 

III II I i I I II 

al28 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



ml 28. pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 

190 200 210 220 230 240 



ml28.pep 

al28 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 

140 150 

ml 2 8 . pep YASEKLREAKYAFSETXVKKYFPVGX 

11:1111111111111 I I I M I 1 I 
al2 8 ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

160 170 180 190 200 210 

ml2 8 . pep VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
II II I I II I I I I I I I I I I M I I I I I I I I I I I II I I H : I I I I I I i I I II I I I I I I 1 I I 
a!28 VLNGL FAQ IKKL YG IG FTEKTVPVWHKDVRYFE LQQNGET I GGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

220 230 240 250 260 270 

ml28 . pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
I M I 1 M t I II 11 I M I I M I M I ! : i 1 I M : 1 1 I I I II I M I I 11 i I M M I I II I M 
al28 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

280 290 300 310 320 330 

ml28.pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
I I I II 1 1 I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I II I I I I 
al28 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

340 350 360 370 380 390 

ml28 . pep XGMFXVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
Ml Ml I M I I I I I I I I I 11 I M II I I I I 11 I I I I I : I I I ::[ I I I I I I I I I I I I I 
al28 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 

550 560 570 580 590 600 
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400 410 420 430 440 450 

ml28 .pep AGG Y S AAX YS YAWAE V LS ADAYAAFEE S DDVAATGKRFWQE I LAVGXS RS GAES FKAFRG 

MINI: I II I I I I M I M 1 I I U I I I I I M I I I I I 1 I 1 I I I M I I M : i I I I I I M I 
al28 AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 

460 470 
ml 2 8 . pep RE PS I DALLRHSGFDNAVX 

I I I I I I I I I I M M M I : 
al28 RE PS I DALLRHSGFDNAAX 

670 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 309 1>: 

ml28-l . seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 

651 ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCT ACGTT ACCCGCGCCA GCGAACTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGCAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTGA ACCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 G CT AC G C C AG CGAAAAACTG CGCGAAGCCA AATACGCGTT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAAT C AAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 

1351 GGCAGGGAAG CCCGCCTGAG CCACGACGAA ATCCTCATCC TCTTCCACGA 

14 01 AACCGGACAC GGGCTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGCATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 

1751 TCCAGCCGCC CGAATACAAC CGCTTCGCCT TGAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 3092; ORF 128-1>: 

ml28-l .pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVADTP ELRAVYNELM PEITVFFTEI 
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101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLANALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 1GGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

4 51 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPS1DALLR HSGFDNAV+ 



The following DNA sequence was identified in N, gonorrhoeae <SEQ ID 3093>: 

gl28-l . seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCT CG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAAT CAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT T G C AACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 - TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAACGGCGTA AAA 



This corresponds to the amino acid sequence <SEQ ID 3094; ORF 128-1 .ng>: 

gl28-l .pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 



ml28-l/gl28-l ORFs 128-1 and 128-1. ng showed a 94.5% identity in 4 91 aa 
overlap 



10 20 30 40 50 60 
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gl28-l . pep MIDNALLHLGEEPRFNQIKTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 
i I I M I I I II I 1 t I : I i H I i II I I : I I I I I I I I I I I I : I I I I I I I ) I II I! Mill 
ml28-l MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 



70 80 90 100 110 120 

gl28-l . pep ERVGRIWGWSHLNSVVDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

I M I I M I I I I I I I I I : I I I I I I I ! I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
ml28-l ERVGRIWGVVSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 



130 140 150 160 170 180 

gl28-l . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
I II II U I M : II II II II II ! I 1 I II : I II I M M II t I M I I I I M I II I M I I I 1 M 
ml28-l TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



190 200 210 220 230 240 

gl28-l . pep FDDAAPLAGI PEDALAMFAAAAQSEGKTGYKIGLQI PHYLAVIQYAGNRELREQIYRAYV 
I I I II I I I I I I I I I f II I I I I I I I I : I I I I I I I I I I I I I I I I I I II I I I M I I I I If M 
ml 2 8 - 1 FDDAAPLAGI PEDALAMFAAAAQSESKTGYKIGLQI PHYLAVIQYADNRELREQI YRAYV 

190 200 210 220 230 240 



250 260 270 280 290 300 

gl28-l.pep TRASELSNDGKFDNTANIDRTLENALKTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
I I M I I I : I I I I I I I I I I I I I I I I I : I 11 M I I II I I I I I M I I II M I I I M I I I I I I 
ml28-l TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 



310 320 330 340 350 360 

gl28-l - pep ARRAKPYAEKDLAEVKAFAREHLGLADPQPWDLSYAGEKLREAKYAFSETEVKKYFPVGK 
I I I M 11 I M i M I I I I it 1 I I : M I I M I I : I I : I I I I I I I I I II I M I II i I I II I 
ml28-l ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 



370 380 390 400 410 420 

gl28-l . pep VLAGLFAQIKKLYGIGFAEKTVPVWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWM 
It II I M i I I M I I I I : I I ! I I I I M I I I I II I I I I I : I II I I I I I ! I I M I II I I I I I 
ml 2 8 - 1 VLNGLFAQIKKLYGI GFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 



430 440 450 460 470 480 

gl28-l . pep WDYKGRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
I I M I ! I M : M II 1 I I I I I I I I I I II M I I : I I II II M I I I I I 11 I M 1 M I 1 1 I I I 
ml28-l NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

430 440 450 460 470 480 



490 

g 1 2 8 - 1 . pep ELGVSGINGVK 
I I M I I I I 1 I : 

ml28-l ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 



The following DNA sequence was identified in TV. meningitidis <SEQ ID 3095>: 

al28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
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501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

7 51 AAATT CGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG C C AT G AC G AA ATCCTCACCC TCTTCCACGA 

1401 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGA'G 

1651 TTCGCCCTCT ttgatatgat gatttacagc gaagacgacg aaggccgtct 

1701 gaaaaactgg caacaggttt tagacagcgt gcgcaaagaa gtcgccgtcg 

1751 tccgaccgcc cgaatacaac cgcttcgcca acagcttcgg ccacatcttc 

1801 gcaggcggct attccgcagg ctattacagc tacgcgtggg cggaagtatt 

1851 gagcgcggac gcatacgccg cctttgaaga aagcgacgat gtcgccgcca 

1901 caggcaaacg cttttggcag gaaatcctcg ccgtcggcgg atcgcgcagc 

1951 gcggcagaat ccttcaaagc cttccgcgga cgcgaaccga gcatagacgc 

2 001 actcttgcgc cacagcggct tcgacaacgc ggcttga 

This corresponds to the amino acid sequence <SEQ ID 3096; ORF 128-l.a>: 

al28-l.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28-l/al28-l ORFs 128-1 and 128-1. a showed a 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

al2 8-l .pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I It I I I ! I I I I I I I ! I I I I I I I I I I I 
rtvl28-l MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

al2 8-l .pep ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I I I I I I ! I I I I I : I I I I I I i : M I I I I I I I I I I I I II I I I I J i I f I I I I I I I I II i 
ml28-l ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 140 150 160 170 180 
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al28-l . pep TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
Ml I I i M I I I I I I I I i I I II I i I I I I I 1 ! I 11 I I I I 1 1 i I I I I 1 I M I I I I I I I I I I I 
ml28-l TLSPAQKTKLNHDLRDFVLSGAELPPEOQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



190 200 210 220 230 240 

al28-l.pep FDDAAPLAGI PEDALAMFAAAAQ5EGKTGYKIGLQ I PHYLAVIQYADNRKLREQI YRAYV 
I I I I I t I I I ! I I I I I I It I I 11 I I I : I I I I I I i I I I I I I I I M I I I I I I : I I I I I I I I M 
ml28-l FDDAAPLAGI PEDALAMFAAAAQSESKTGYKIGLQIPHYLAVIQYADNRELREQI YRAYV 

190 200 210 220 230 240 



250 260 270 280 290 300 

a 128-1 .pep TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
1 1 I I M I t M ! I I I ( I I 1 1 M 1 M I II I I It I I I i M I I I M I I M 1 1 1 M II I ( M M 
ml28-l TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 



310 320 330 340 350 360 

al28 - 1 . pep ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
i i I I I I I I I I I t M I I I I I I I M : I I I I i I 1 I I I I I : 1 I 1 I I I M I I I I I I I I I I i ! I I I 
ml 2 8 - 1 ARRAKP YAEKDLAEVKAFARESLN LADLQPWDLGYASEKLREAKYAFSETEVKKY FPVGK 

310 320 330 340 350 360 



370 380 390 400 410 420 

al28-l . pep VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
I I I I II I I I I II II I II II I M II I I I I I I I I I I I M II I II II II I I I I I M i I I II I I 
ml28-l VLNGLFAQI KKL YG I G FTEKT VPVWHKDVR YFE LQQNGET I GGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 



430 440 450 460 470 480 

al28-l.pep NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
I I I II I I I I I I 11 I I I I I I I I I I I I : I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 
ml28-l NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
430 440 450 460 470 480 



490 500 510 520 530 540 

al28-l.pep ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
II i I 1 M II M I I I II I M i 1 I M I M I I I I I M I 11 1 I 1 I I I M I M 1 ! M I I I I I I M 
m!28-l ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 



550 560 570 580 590 600 

a!28-l.pep RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 
I II I I I I I I I I I I II I I I I I I I I I! I I I M I I I I I II M : I I I : : M I I I I I I MINI 
m!28-l RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 

550 560 570 580 590 600 



610 620 630 640 650 660 

al28-l . pep AGGYSAGYYS YAW AEVLS AD AYAA FEES DDVAATGKRFWQE I LAVGGSRS AAES FKAFRG 
I I I I I I I I I I M II I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I II I I II I 
ml28-l AGGYSAGYYS YAWAEVLSADAYAAFEESDDVAATGKRFWQEI LAVGGSRSAAES FKAFRG 

610 620 630 640 650 660 



670 679 
al28-l .pep REPSIDALLRHSGFDNAAX 

M 1 M I I I I M i N II I : 
ml 28-1 REPSI DALLRHSG FDNAVX 

670 



206 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3097>: 

m206 .seq 
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1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

3 01 GCCCGCGACA TGGCGGCGGC AAG CCG s AAA ATCCCCGAcA GCCGCyTCAA 
351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 
451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 
501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 



This corresponds to the amino acid sequence <SEQ ID 3098; ORF 206>: 
m206 .pep. . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 



51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 
101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 
151 GKTI KTEKLS TPFYAKNYLG AHTFFTE* 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3099>: 



1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca tcggcacgcc 

201 ctacaaatgg ggcggcagca gcaccgcaac cggc&tcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

301 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

3 51 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 

4 01 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 
451 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 
501 ctaccttgga gcgcatacgt tttttacaga atga 



This corresponds to the amino acid sequence <SEQ ID 3100; ORF 206.ng>: 

g206 .pep 

1 MFSPDKTLFL CLGALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIGRT 



51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 
101 ARDMAAASRK IPDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 
151 GKTI KTEKLS TPFYAKNYLG AHTFFTE* 



ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from N. gonorrhoeae: 

m206/g206 



g206 . seq 



m206 .pep 



10 20 30 40 50 60 

MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 



g206 




10 20 30 40 50 60 



m206 -pep 



70 80 90 100 110 120 

LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 



g206 




70 80 90 100 110 120 



m206 -pep 



130 140 150 160 170 

LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 



g206 
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130 140 150 160 170 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3101>: 

a206 . seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

2 01 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

4 51 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACG CCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3102; ORF 206.a>: 

a206.pep 

1 MFPPDK TLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQEVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 ORFs 206 and 206.a showed a 99.4% identity in 1 77 aa overlap 

10 20 30 40 50 60 

m206 . pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 
M I I I t M I I It I I ! U 11 II M I I M 1 I I ! I M M I I I M I M I II I M i I I I II M M 
a2 06 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 06 . pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTT^RDMAAASRKIPDSRXKAGD 
i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I M I I I I I I I I I I I MM 
a206 LGL I GT P YKWGG S ST ATGFDCSGM I QFVYKNALNVKLPRT ARDMAAASRK IPDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

m206 . pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
I M M II I I II II M II I M M M M M I II I II II II II II I M M I M M M M II 
a 206 LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

130 140 150 160 170 



287 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 3103>: 

m287 . seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAGC 

51 CTGCGGGGGC GGCGGTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AG AGG CAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 

201 AGGCAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC CGAATCACAC 

351 CCCGGATCCG AATATGCTTG CCGGAAATAT GGAAAATCAA GCAACGGATG 

4 01 CCGGGGAATC GTCTCAGCCG GCAAACCAAC CGGATATGGC AAATGCGGCG 

4 51 GACGGAATGC AGGGGGACGA TCCGTCGGCA GGCGGGCAAA ATGCCGGCAA 

501 TACGGCTGCC CAAGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 

551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGCGAA TGGCGGTAGC 
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601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 



AATTTTGGAA 
GCAAAATATA 
ATTTCTTGGA 
GATGCAGACA 
TGTCGGTTTG 
TTATCTTTTA 
GCACGGTCGA 
TCAGGCGGAT 
ATTCCGGCAA 
GGGGCGGAAA 
ACCGGCAAAA 
TACT G CAT TT 
TTTGCCGCAA 
CAGCGGCGAT 
ATGGAAACGG 
TCCGGAAAGT 
CTATCGCCCG 
AAAAAGAGCA 



GGGTTGATTT 
ACGTTGACCC 
TGAAGAAGTA 
AAATAAGTAA 
GTTGCCGATA 
TAAACCTAAA 
GGCGGTCGCT 
ACGCTGATTG 
TATCTTCGCG 
AATTGCCCGG 
GGCGAAATGC 
CCATACGGAA 
AAGTCGATTT 
GATTTGCATA 
CTTTAAGGGG 
TTTACGGCCC 
ACAGATGCGG 
GGATTGA 



GGCTAATGGC 
ACTGTAAAGG 
CAGCTAAAAT 
TTACAAGAAA 
GTGTGCAGAT 
CCCACTTCAT 
TCCGGCCGAG 
TCGATGGGGA 
CCCGAAGGGA 
CGGATCGTAT 
TTGCGGGCGC 
AACGGCCGTC 
CGGCAGCAAA 
TGGGTACGCA 
ACTTGGACGG 
GGCCGGCGAG 
AAAAGGGCGG 



GTTTTGATTG 
CGATTCTTGT 
CAGAATTTGA 
GATGGGAAGA 
GAAGGGAATC 
TTGCGCGATT 
ATGCCGCTGA 
AGCGGTCAGC 
ATTACCGGTA 
GCCCTTCGTG 
GGCCGTGTAC 
CGTACCCGAC 
TCTGTGGACG 
AAAATTCAAA 
AAAATGGCAG 
GAAGTGGCGG 
ATTCGGCGTG 



ACGGGCCGTC 
AGTGGCAATA 
AAAATTAAGT 
ATGATAAATT 
AATCAATATA 
TAGGCGTTCT 
TTCCCGTCAA 
CTGACGGGGC 
TCTGACTTAC 
TTCAAGGCGA 
AACGGCGAAG 
CAGGGGCAGG 
GCATTATCGA 
GCCGCCATCG 
CGGGGATGTT 
GAAAATACAG 
TTTGCCGGCA 



This corresponds to the amino acid sequence <SEQ ID 3104; ORF 287>: 

m287 -pep 

1 MFKRSVIAMA CIFALSA CGG GGGGSPDVKS ADTLSKPAAP WSEKETEAK 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

351 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 

401 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV 

4 51 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 



The following partial DNA sequence was 

g287.seq 

1 atgtttaaac gcagtgtgat 

51 ctgtgggggc ggcggtggcg 

101 cgtcaaaacc ggccgccccc 

151 ctgccgaaag aaaagaaaga 

201 cgatacgcag gacgcaaccg 

251 tttcggcaga aaatacaggc 

301 aaaaatgaag acgcgggggc 

351 atccgcaaat caaacaggga 

401 cccccgcgtc aaaccctgcc 

451 acgaacgtgg gcaattctgt 

501 gttgacccac tgtaaaggcg 

551 aagaagcacc gtcaaaatca 

601 attaagcgat ataaaaaaga 

651 tgctgacagg gtaaaaaagg 

701 cggacaaacc acctactcgt 

751 gagattccgc tgattcccgt 

801 ggaagcggtc agcctgacgg 

851 ggaattaccg gtatctgact 

901 tatgccctcc gtgtgcaagg 

951 cacggccgtg tacaacggcg 

1001 gtccgtaccc gtccggaggc 

1051 aaatctgtgg acggcattat 

1101 gcaaaaattc aaagccgcca 

1151 cggaaaatgg cggcggggat 

1201 gaggaagtgg cgggaaaata 

1251 cggattcggc gtgtttgccg 



identified in N. gonorrhoeae <SEQ ID 3105>: 



tgcaatggct 
gatcgcccga 
gttgttgctg 
tgaggaggca 
ccggagaagg 
aatggcggtg 
gcaaaatgat 
acaaccaacc 
cctgcgaatg 
tgtgattgac 
attcttgtaa 
gaatttgaaa 
cgagcaacgg 
atggaactaa 
tctgcacggt 
caatcaggcc 
ggcattccgg 
tacggggcgg 
cgaaccggca 
aagtgctgca 
aggtttgccg 
cgacagcggc 
tcgatggaaa 
gtttccggaa 
cagctatcgc 
gcaaaaaaga 



tgtatttttc 
tgtcaagtcg 
aaaatgccgg 
gcgggcggtg 
cagccaagat 
cggcaacaac 
atgccgcaaa 
cgccggttct 
gcggtagcga 
ggaccgtcgc 
tggtgataat 
aattaagtga 
gagaattttg 
caaatatatc 
cgaggaggtc 
gatacgctga 
caatatcttc 
aaaaattgcc 
aaaggcgaaa 
tttccatatg 
caaaagtcga 
gatgatttgc 
cggctttaag 
ggttttacgg 
ccgacagatg 
tcgggattga 



ccctttcagc 
gcggacacgc 
ggaaggggtg 
cgccgcaagc 
atggcggcag 
ggacaacccc 
atgccgccga 
tcagattccg 
ttttggaagg 
aaaatataac 
ttattggatg 
tgaagaaaaa 
tcggtttggt 
atcttctata 
gcttccggcc 
ttgtggatgg 
gcgcccgaag 
cggcggatcg 
tgcttgttgg 
gaaaacggcc 
tttcggcagc 
atatgggtac 
gggacttgga 
cccggccggc 
ctgaaaaggg 



This corresponds to the amino acid sequence <SEQ ID 3106; ORF 287.ng>: 

g287 .pep 

1 MFKRSVIAMA CIFPLSA CGG GGGGSPDVKS ADTPSKPAAP VVAENAGEGV 
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51 LPKEKKDEEA AGGAPQADTQ DATAGEGSQD MAAVSAENTG NGGAATTDNP 

101 KNEDAGAQND MPQNAAESAN QTGNNQPAGS SDSAPASNPA PANGGSDFGR 

151 TOVGNSVVID GPSQNITLTH CKGDSCNGDN LLDEEAPSKS EFEKLSDEEK 

201 IKRYKKDEQR EN FVGLVADR VKKDGTNKYI IFYTDKPPTR SARSRRSLPA 

251 EIPLIPVNQA DTLIVDGEAV SLTGHSGNIF APEGNYRYLT YGAEKLPGGS 

301 YALRVQGEPA KGEMLVGTAV YNGEVLHFHM ENGRPYPSGG RFAAKVDFGS 

351 KSVDGIIDSG DDLHMGTQKF KAAIDGNGFK GTWTENGGGD VSGRFYGPAG 

4 01 EEVAGKYSYR PTDAEKGGFG VFAGKKDRD* 



m287/g287 ORFs 287 and 287.ng showed a 70.1% identity in 499 aa overlap 

10 20 30 40 49 

m287 .pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

I M I i I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I! I : I I : II 

g 287 MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPVVAENAGEGVLPKEKKDEEA 

10 20 30 40 50 60 



50 60 70 80 90 100 109 

m2 87 . pep KEDAPQAGSQGQGAPSAQGSQDHAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNT^AGT 

MM : I I : : : M I II I I M M II I M I : I : II II I I I I II M M I M 
g287 AGGAPQADTQD— ATAGEGSQDMAAVSAENTGNGGAATTDWPKNEDAGAQNDMPQNAA— 

70 80 90 100 110 

110 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 

g287 



170 180 190 200 210 220 229 

m287 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 
: : I I I : M M Mill If I M M M I M : I M : : : : I : I : I I I M M M M I M M I 
g 287 -ESANQTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDS 
120 130 140 150 160 170 



230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

I : I : I : I I I I : M I M I I M Ml: MM : : : I I I M M I : I I M M M 
g287 CNGDNLLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTD 
180 190 200 210 220 230 



290 300 310 320 330 340 349 

m287 . pep KPTSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 
II : M I I I II I I II I M II I I I II I II I M It I I I I I I I M I (I I II I M II I I 

g287 KPPT RSARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 

240 250 260 270 280 290 



350 360 370 380 390 400 409 

m287 . pep YGAEKLPGGS YALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGS 

II II 1 I I I M I I II I I I M I I I I I I :\ M I II I I I I I I! I I II II I : I I I I I I M I I I 
g 2 8 7 YGAEKLPGGS YALRVQGEP AKGEMLVGT AVYNGEVLHFHMENGRP YPSGGRFAAKVDFGS 

300 310 320 330 340 350 



410 420 430 440 450 460 469 

m287 . pep KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYR 
I I II I I I I I II I M II I I I I I II II I I I II M II II I M I I M M I M I I I I I II II I I I 
g287 KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYR 

360 370 380 390 400 410 



m287 .pep 



470 480 489 

PTDAEKGGFGVFAGKKEQDX 
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I M II I I I If I I I I I I : : I I 
g 2 8 7 PT DAE KG G FG V FAGKK DRDX 

420 430 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3107>: 

a287.seq 

1 ATGTTTAAAC GCAGTGTGAT TGCAATGGCT TGTATTGTTG CCCTTTCAGC 

51 CTGTGGGGGC GGCGGTGGCG GATCGCCCGA TGTTAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTACTG AAGATGTCGG GGAAGAGGTG 

151 CTGCCGAAAG AAAAGAAAGA TGAGGAGGCG GTGAGTGGTG CGCCGCAAGC 

201 CGATACGCAG GACGCAACCG CCGGAAAAGG CGGTCAAGAT ATGGCGGCAG 

251 TTTCGGCAGA AAATACAGGC AATGGCGGTG CGGCAACAAC GGATAATCCC 

301 GAAAATAAAG ACGAGGGACC GCAAAATGAT ATGCCGCAAA ATGCCGCCGA 

351 T AC AG AT AG T TCGACACCGA ATCACACCCC TGCACCGAAT ATGCCAACCA 

401 GAGATATGGG AAACCAAGCA CCGGATGCCG GGGAATCGGC ACAACCGGCA 

4 51 AACCAACCGG AT AT GGC AAA TGCGGCGGAC GGAATGCAGG GGGACGATCC 

501 GTCGGCAGGG GAAAATGCCG GCAATACGGC AGATCAAGCT GCAAATCAAG 

551 CTGAAAACAA TCAAGTCGGC GGCTCTCAAA ATCCTGCCTC TTCAACCAAT 

601 CCTAACGCCA CGAATGGCGG CAGCGATTTT GGAAGGATAA ATGTAGCTAA 

651 TGGCATCAAG CTTGACAGCG GTTCGGAAAA TGTAACGTTG ACACATTGTA 

701 AAGACAAAGT ATGCGATAGA GATTTCTTAG ATGAAGAAGC ACCACCAAAA 

7 51 TCAGAATTTG AAAAATTAAG TGATGAAGAA AAAATTAATA AATATAAAAA 

801 AGACGAGCAA CGAGAGAATT TTGTCGGTTT GGTTGCTGAC AGGGTAGAAA 

851 AGAATGGAAC TAACAAATAT GTCATCATTT ATAAAGACAA GTCCGCTTCA 

901 TCTTCATCTG CGCGATTCAG GCGTTCTGCA CGGTCGAGGC GGTCGCTTCC 

951 GGCCGAGATG CCGCTGATTC CCGTCAATCA GGCGGATACG CTGATTGTCG 

1001 ATGGGGAAGC GGTCAGCCTG ACGGGGCATT CCGGCAATAT CTTCGCGCCC 

1051 GAAGGGAATT ACCGGTATCT GACTTACGGG GCGGAAAAAT TGTCCGGCGG 

1101 ATCGTATGCC CTCAGTGTGC AAGGCGAACC GGCAAAAGGC GAAATGCTTG 

1151 CGGGCACGGC CGTGTACAAC GGCGAAGTGC TGCATTTCCA TATGGAAAAC 

1201 GGCCGTCCGT CCCCGTCCGG AGGCAGGTTT GCCGCAAAAG TCGATTTCGG 

1251 CAGCAAATCT GTGGACGGCA TTATCGACAG CGGCGATGAT TTGCATATGG 

1301 GTACGCAAAA ATTCAAAGCC GTTATCGATG GAAACGGCTT TAAGGGGACT 

1351 TGGACGGAAA ATGGCGGCGG GGATGTTTCC GGAAGGTTTT ACGGCCCGGC 

14 01 CGGCGAAGAA GTGGCGGGAA AATACAGCTA TCGCCCGACA GATGCGGAAA 

14 51 AGGGCGGATT CGGCGTGTTT GCCGGCAAAA AAGAGCAGGA TTGA 

This corresponds to the amino acid sequence <SEQ ID 3108; ORF 287. a>: 

a287 .pep 

1 MFKRSVIAMA CIVALSA CGG GGGGSPDVKS ADTLSKPAAP WTEDVGEEV 

51 LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 

101 ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 

151 NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQAENNQVG GSQNPASSTN 

201 PNATNGGSDF GRIN VANG IK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 

251 SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKDKSAS 

301 SSSARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 

351 EGNYRYLTYG AEKLSGGSYA LSVQGEPAKG EMLAGTAVYN GEVLHFHMEN 

4 01 GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VIDGNGFKGT 

4 51 WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 

m287/a287 ORFs 287 and 287. a showed a 77.2% identity in 501 aa overlap 

10 20 30 40 49 

m287 . pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

I M I II I M 1 1 I I I I I I I I I I I I I I I I i I I 1 I 1 1 1 1 I I 11 I : I | : | ) 

a287 MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPVVTEDVGEEVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

m287 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT ■ 

UN : I I : : : I : I I I I I I I I I I I I I I I : I : II I : I : I I I I I M I I II I 
a287 VSGAPQADTQ — DATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADT 

70 80 90 100 110 
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110 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGHQGDDPSAGGQNAGNTA 
MINIMI Ml : : I III M I It : I I M I M M I M I II I I M I I M M I M M 
a 2 87 DSSTPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAG-ENAGNTA 
120 130 140 150 160 170 



170 180 190 200 210 220 229 

m287 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 

I : M M I I I : M I : : I : : M : I I M : M I : : : I M : : I : I : I : I M M I 
a 28 7 DQAANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKV 
180 190 200 210 220 230 



230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADK1SNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

I : : M II 1 : MINIMI M M : I M I : : : I M M I I I : : I I : M M I I 
a 287 CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKD 
240 250 260 270 280 290 



290 300 310 320 330 340 

m287 . pep KP — TSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 

I : I M I I M M M M M M I II M I I M M I M M I M M I M M M M I M M M 
a 287 KSASSSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 
300 310 320 330 340 350 



350 360 370 380 390 400 

m287 . pep LTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDF 

M I It M 1 I) M M M M M M I M I I : M M M M II I Mill 1 : II M M M I 
a287 LTYGAEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDF 
360 370 380 390 400 410 



410 420 430 440 450 460 

m287 . pep GSKSVDGIIDSGDDLHMGTQKFKAAI DGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYS 

1 I I M II M M M I M I M M I M M M M M I M M M M M I I : I M 1 M I M M II I 
a287 GSKSVDGI I DSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYS 

420 430 440 450 460 470 



470 480 489 

m2 8 7 . pep YRPTDAEKGGFGVFAGKKEQDX 

I I I I II M I I I II M I I M M I 
a2 87 YRPTDAEKGGFGVFAGKKEQDX 
480 490 



406 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3109>: 

m406 . seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TTGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGG AT T TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 CACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

3 01 GATTACACCT AT CCACGT TA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CTCGCACCCA AT C AGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC AC CT ATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 
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701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACGGAAG GATTAATGGT CGATTTCTCC GATATCCGAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CAT C CGTAG A GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GTAGTGCGAC AACATAGACA 

951 AGGACAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 31 10; ORF 406>: 

m406 .pep 

1 MQARLLIPIL F5VFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIRPYGNHTG NSAPSVEADN 

3 01 SHEGYGYSDE WRQHRQGQP * 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31 1 1>: 

g406 . seq 

1 ATGCGGGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGCAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC T GCCGTT AAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

2 51 TTGATGCACT GATTCGCGGC GAATACATAA ACAGCCCTGC CGTCCGCACC 

3 01 GATTACACCT ATCCGCGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 
351 TTTGACGGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 
401 CGCGCACCCA ATCAGACGGT AGCGGAAGTA GGAGCAGTCT GGGCTTAAAT 
451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CCAACCCGCG 
501 CGACACTGCC TTTCTTTCCC ACTTGGTGCA GACCGTATTT TTCCTGCGCG 
551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 
601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 
651 TGCCGAAACA CTGAAAG CCC AAACAAAACT GGAATATTTC GCAGTAGACA 
701 GAACCAATAA AAAATTGCTC ATCAAAC C C A AAACCAATGC GTTTGAAGCT 
751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 
801 AGGAATCAAA CCGACGGAAG GATTGATGGT CGATTTCTCC GATATCCAAC 
851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 
901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC AACATAGACA 
951 AGGGCAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 31 12; ORF 406>: 

g4G6 .pep 

1 MRARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSRSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHTG NSAPSVEADN 

3 01 SHEGYGYSDE AVRQHRQGQP * 



ORF 406 shows 98.8% identity over a 320 aa overlap with a predicted ORF (ORF406.a) from 
N. gonorrhoeae: 

g406/m406 

10 20 30 40 50 60 

g4 06 . pep MRARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

hlllMMIIIIIMUMMIMIIMIIIIMIIMIIItlllMMIMIMIMI 

m4 0 6 MQARLLI P I LFSVFI LSACGTLTG I PSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
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10 20 30 40 50 60 

70 80 90 100 110 120 

g4 06 .pep KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

Mi I MMMM II IM II II MM II Ml II IMMM I II II MM MIIIMM MM II M 

m4 06 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 

130 140 150 160 170 180 

g4 06 .pep LTTSLSTLNAPALSRTQSDGSGSRSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

III IMMIIMM I I MINIMUM Mil MMMMII KM II IN MINIMI 

m4 06 LTT5LSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 

190 200 210 220 230 240 

g4 0 6 . pep FLRG I DWS PANADTDVFIN1 DVFGTI RNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

I M N 1 1 N M M 1 1 M M I M M I M M 1 1 M I M 1 1 1 1 M M M I M M M M I M M 

m4 06 FLRG I DWS PANADTDVF IN I DVFGT I RNRTEMHLYWAE TLKAQTKLE YFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

g4 06 .pep I KPKTNAFEAAYKENYALWMGPYKVS KGI KPTEGLMVDFSD I QP YGNHTGNS APS VEADN 

1 1 1 M M I M 1 1 1 M t M I i 1 1 1 1 1 MMMMIMMI Ml MMMMMIMMIM 

m4 0 6 I KPKTNAFEAAYKENYALWMGP YKVSKGI KPTEGLMVDFSDI RP YGNHTGNS APS VEADN 

250 260 270 280 290 300 

310 320 
g4 0 6 . pep SHEGYGYSDEAVRQHRQGQPX 

1 1 1 M 1 1 1 1 M M 1 1 I 1 1 1 1 I 

m4 06 SHEGYGYSDEVWQHRQGQPX 

310 320 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3 1 13>: 

a406. seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 G AC AT GG AT T TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

301 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

4 01 CGCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACGGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

7 51 GCCTATAAAG AAAATTACGC ATTGTGGATG GGACCGTATA AAGTAAGCAA 

801 AG G AAT T AAA CCGACAGAAG GATTAATGGT CGATTTCTCC GATATCCAAC 

851 CATACGGCAA TCATATGGGT AACTCTGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC GACATAGACA 

951 AGGGCAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 3 1 14; ORF 406.a>: 

a4 06.pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRG I DWS P ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTOAFEA 
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251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHMG NSAPSVEADN 
301 SHEGYGYSDE AVRRHRQGQP * 

m406/a406 ORFs 406 and 406. a showed a 98.8% identity in 320 aa overlap 

10 20 30 40 50 60 

m4 06.pep MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
M I I I I I I I I I M I I I I I I I I I M I I I I t I I I I I II I I I I I M It I M I I I M I I I I I I I 
a4 06 MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m406 . pep KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 
I I I I M I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I M I I I I I I I I II I I I I M 
a4 0 6 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m4 06.pep LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 
M I I I I I I I I I I II I I I II I I I I I I 1 I II I I I I I I M I I I I I I (I I I I I I I I I 1 I I M I I 
a4 06 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m4 0 6 . pep FLRG I DVVS PANADTDVFINI DVFGT IRNRTEMHL YNAETLKAQTKLE Y FAVDRTNKKLL 

I I M I I I II I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I M 1 I I I M 
a406 FLRGIDWS PANADTDVFINI DVFGT IRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m4 0 6 . pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
I ! M III I I I I I I I II M M IK ! 1 I ! II M I II 111 t M I t : I I I M I I I II I I I I I I 
a 4 06 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIQPYGNHMGNSAPSVEADN 

250 260 270 280 290 300 

310 320 
m4 0 6 . pep SHEGYGYS DE WRQHRQGQPX 

I I I I I I I I If : I I : I I I I I I I 
a 4 0 6 SHEGYGYS DE AVRRHRQGQPX 

310 320 



EXAMPLE 2 
Expression of ORF 919 
The primer described in Table 1 for ORF 919 was used to locate and clone ORF 919. 
The predicted gene 919 was cloned in pET vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 919-His fusion protein purification. Mice were immunized with the purified 919- 
His and sera were used for Western blot (panel B), FACS analysis (panel C), bactericidal 
assay (panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; PP, 
purified protein, TP, K meningitidis total protein extract; OMV, N. meningitidis outer 
membrane vesicle preparation. Arrows indicate the position of the main recombinant protein 
product (A) and the AT. meningitidis immunoreactive band (B). These experiments confirm 
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that 919 is a surface-exposed protein and that it is a useful immunogen. The hydrophilicity 
plots, antigenic index, and amphipatic regions of ORF 919 are provided in Figure 10. The 
AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 
143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, 
Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 919 and the amino acid 
sequence encoded thereby is provided in Example 1 . 

EXAMPLE 3 
Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
The predicted gene 279 was cloned in pGex vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 279-GST purification. Mice were immunized with the purified 279-GST and sera 
were used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay 
(panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. 
meningitidis total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. 
Arrows indicate the position of the main recombinant protein product (A) and the jV. 
meningitidis immunoreactive band (B). These experiments confirm that 279 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 279 are provided in Figure 1 1 . The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 279 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 4 
Expression of ORF 576 and 576-1 
The primer described in Table 1 for ORF 576 was used to locate and clone ORF 576. 
The predicted gene 576 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 576- 
GST fusion protein purification. Mice were immunized with the purified 576-GST and sera 
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were used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), 
and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis 
total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. Arrows 
indicate the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B).. These experiments confirm that ORF 576 is a surface-exposed 
protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and 
amphipatic regions of ORF 576 are provided in Figure 12. The AMPHI program is used to 
predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi etal. 1992, Scand J Immunol Suppl 11:9). 
The nucleic acid sequence of ORF 576 and the amino acid sequence encoded thereby is 
provided in Example 1. 

EXAMPLE 5 
Expression of ORF 519 and 519-1 
The primer described in Table 1 for ORF 519 was used to locate and clone ORF 519. 
The predicted gene 519 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 519- 
His fusion protein purification. Mice were immunized with the purified 5 19-His and sera were 
used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), and 
ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis total 
protein extract; OMV, N. meningitidis outer membrane vesicle preparation. Arrows indicate 
the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B). These experiments confirm that 519 is a surface-exposed protein 
and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and amphipatic 
regions of ORF 519 are provided in Figure 13. The AMPHI program is used to predict 
putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, AIDS Res 
Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1 :9). The nucleic 
acid sequence of ORF 519 and the amino acid sequence encoded thereby is provided in 
Example 1. 
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EXAMPLE 6 
Expression of ORF 121 and 121-1 
The primer described in Table 1 for ORF 121 was used to locate and clone ORF 121. 
The predicted gene 121 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 121 - 
His fusion protein purification. Mice were immunized with the purified 121 -His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 121 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, K 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 121 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 121 are provided in 
Figure 14. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 121 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 7 
Expression of ORF 128 and 128-1 
The primer described in Table 1 for ORF 128 was used to locate and clone ORF 128. 
The predicted gene 128 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 128- 
His purification. Mice were immunized with the purified 128-His and sera were used for 
Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel D) and 
ELISA assay (panel E). Results show that 128 is a surface-exposed protein. Symbols: Ml, 
molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. meningitidis 
outer membrane vesicle preparation. Arrows indicate the position of the main recombinant 
protein product (A) and the N. meningitidis immunoreactive band (B). These experiments 
confirm that 128 is a surface-exposed protein and that it is a useful immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 128 are provided in 
Figure 15. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
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Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 128 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 8 
Expression of ORF 206 
The primer described in Table 1 for ORF 206 was used to locate and clone ORF 206. 
The predicted gene 206 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 206- 
His purification. Mice were immunized with the purified 206-His and sera were used for 
Western blot analysis (panel B). It is worthnoting that the immunoreactive band in protein 
extracts from meningococcus is 38 kDa instead of 17 kDa (panel A). To gain information on 
the nature of this antibody staining we expressed ORF 206 in E. coli without the His-tag and 
including the predicted leader peptide. Western blot analysis on total protein extracts from E. 
coli expressing this native form of the 206 protein showed a recative band at a position of 38 
kDa, as observed in meningococcus. We conclude that the 38 kDa band in panel B) is specific 
and that anti-206 antibodies, likely recognize a multimeric protein complex. In panel C is 
shown the FACS analysis, in panel D the bactericidal assay, and in panel E) the ELISA assay. 
Results show that 206 is a surface-exposed protein. Symbols: Ml, molecular weight marker; 
TP, N. meningitidis total protein extract; OMV, N. meningitidis outer membrane vesicle 
preparation. Arrows indicate the position of the main recombinant protein product (A) and the 
N, meningitidis immunoreactive band (B). These experiments confirm that 206 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 519 are provided in Figure 16. The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 206 and the amino acid sequence encoded thereby is 
provided in Example 1 . 
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EXAMPLE 9 
Expression of ORF 287 
The primer described in Table 1 for ORF 287 was used to locate and clone ORF 287. 
The predicted gene 287 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 287- 
GST fusion protein purification. Mice were immunized with the purified 287-GST and sera 
were used for FACS analysis (panel B), bactericidal assay (panel C), and ELISA assay (panel 
D). Results show that 287 is a surface-exposed protein. Symbols: Ml, molecular weight 
marker. Arrow indicates the position of the main recombinant protein product (A). These 
experiments confirm that 287 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 287 are provided in 
Figure 17. The AMPHI program is used to predict putative T-cell epitopes (Gao et ai 1989, J. 
Immunol 143:3007; Roberts et aL 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 287 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 10 
Expression of ORF 406 
The primer described in Table 1 for ORF 406 was used to locate and clone ORF 406. 
The predicted gene 406 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 406- 
His fusion protein purification. Mice were immunized with the purified 406-His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 406 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 406 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 406 are provided in 
Figure 18. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et aL 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
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1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 406 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 11 



Table 2 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 225 among different strains. 
Table 2 



225 gene variability: List of used Neisseria strains 


Identification Strains 


Source / reference 


number 






Group B 




zoOl 225 


NG6/88 


R. Moxon / Seiler et al., 1996 


zo02 225 


BZ198 


xv. iviuAuii / ociici ei at., i7"o 


zo03 225 


NG3/88 


R. Moxon / Seiler et al, 1996 


zo04_225 


297-0 


R. Moxon / Seiler et al, 1996 


zo05 225 


1000 


R. Moxon / Seiler et al, 1996 


zo06 225 


BZ147 


R. Moxon / Seiler et al, 1996 


zo07_225 


BZ169 


xv. lvjujAvJii / ociici ei at., Lyyo 


zo08 225 


528 


R. Moxon / Seiler et al, 1996 


zo09 225 


NGP165 


R. Moxon / Seiler et al, 1996 


zolO 225 


BZ133 


R. Moxon / Seiler et ah, 1996 


zoll 225 


NGE31 


R. Moxon / Seller et al. 9 1996 


zol2 225 


NGF26 


R. Moxon / Seiler et al., 1996 


zol3 225 


NGE28 


R. Moxon / Seiler et al, 1996 


zol4 225 


NGH38 


R. Moxon / Seiler et al, 1996 


zol5 225 


SWZ107 


R. Moxon / Seiler et al, 1996 


zol6 225 


NGH15 


R. Moxon / Seiler et al, 1996 


zol7 225 


NGH36 


R. Moxon / Seiler et al, 1996 


zol8 225 


BZ232 


R. Moxon / Seiler et al, 1996 


zol9 225 


BZ83 


R. Moxon / Seiler et al, 1996 


zo20_225 


44/76 


R. Moxon / Seiler et al, 1996 


zo21 225 


MC58 


R. Moxon 


zo96 225 


2996 


Our collection 


Group A 




zo22 225 


205900 


R. Moxon 


zo23 225 


F6124 


R. Moxon 


z2491 


Z2491 


R. Moxon / Maiden et al, 1998 


Group C 




zo24 225 


90/18311 


R. Moxon 


zo25_225 


93/4286 


R. Moxon 
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Others 

zo26_225 A22 (group W) R. Moxon / Maiden et al., 1998 
zo27_225 E26 (group X) R. Moxon / Maiden et aL, 1998 
zo28_225 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zo29_225 E32 (group Z) R. Moxon / Maiden et al y 1998 

Gonococcus 

zo32_225 Ng F62 R. Moxon / Maiden a/., 1998 

zo33_225 Ng SN4 R. Moxon 

fal090 FA1090 R. Moxon 



References: 

Seiler A. et aL 9 MoL Microbiol., 1996, 19(4):841-856. 
Maiden et aL, Proc. NatL Acad. ScL USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 

>FA1090 <SEQ ID 3115> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
R F I H APRTGKN I E I T S LS HKY W SGKY AFARR VKKN DP SRFLN * 

Z2491 <SEQ ID 3116> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARR VKKNDPSRFLN* 

ZO01_225 <SEQ ID 3117> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO02_22S <SEQ ID 3118> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO03_225 <SEQ ID 3119> 

MDSFFKPAVWAVLWLMFAVRLALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 
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ZO04_225 <SEQ ID 3120> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO05_225 <SEQ ID 3121> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTS I STGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

2O06_225 <SEQ ID 3122> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DE LIGNAMGLLG I AYRYGGTSVSTGFDCSGFMQH I FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO07_225 <SEQ ID 3123> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO08_225 <SEQ ID 3124> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 
DEL I GNAMGLLG I AYR YGGT S I S TG FDC S G FMQH I FKRAMG I N L PRT S AEQARMGT P VAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO09_225 <SEQ ID 3125> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTS I STGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO10_225 <SEQ ID 3126> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHI FKRAMG INLPRT SAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZOH_225 <SEQ ID 3127> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z012_225 <SEQ ID 3128> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLG I AYR YGGT SI STGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z013 225 <SEQ ID 3129> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFIQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z014_225 <SEQ ID 3130> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z015_225 <SEQ ID 3131> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

ZO!6_225 <SEQ ID 3132> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z017_225 <SEQ ID 3133> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z018_225 <SEQ ID 3134> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z019_225 <SEQ ID 3135> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO20_225 <SEQ ID 3136> 

MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPINRAPARRAGNADELIGSAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIE ITS LSHKYWSGKYAFARR VKKNDPSRFLN* 

Z021_225 <SEQ ID 3137> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z022_225 <SEQ ID 3138> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGI AYRYGGTS I STGFDCSGFMQHI FKRAMGINLPRTS AEQARMGT PVAR 
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SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z023_225 <SEQ ID 3139> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAE DEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z024_225 <SEQ ID 3140> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAE DEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z025_225 <SEQ ID 3141> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAE DEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z026_225 <SEQ ID 3142> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAE DEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z027_225 <SEQ ID 3143> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z028_225 <SEQ ID 3144> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z029_225 <SEQ ID 3145> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLG IAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z032_225 <SEQ ID 3146> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQFAE DEQPVLPVN RAP ARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z033_225 <SEQ ID 3147> 

MDS FFKPAVWAVLWLMFAVRSALADELTNLLS SREQI LRQFAE DEQPVLPVN RAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRIKKNDPSRFLN* 

Z096 225 <SEQ ID 3148> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DEL I GNAMGLLGI AYRYGGTS I STGFDCSGFMQHI FKRAMG INLPRTSAEQARMGT PVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 



Figure 19 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 225, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 12 



Table 3 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 235 among different strains. 



Table 3 



235 gene variability: List of used Neisseria strains 


Identification Strains 


Reference 


number 






Group B 




gnmzqOl 


NG6/88 


Seiler e* <z/., 1996 


gnmzq02 


BZ198 


Seiler a/., 1996 


gnmzq03 


NG3/88 


Seiler etal., 1996 


gnrazq04 


1000 


Seiler etal., 1996 


gnmzq05 


1000 


Seiler etal., 1996 


gnmzq07 


BZ169 


Seiler et al., 1996 


gnmzq08 


528 


Seiler etal., 1996 


gnmzq09 


NGP165 


Seiler et al., 1996 


gnmzqlO 


BZ133 


Seiler et al., 1996 


gnmzqll 


NGE31 


Seiler et al., 1996 


gnmzql3 


NGE28 


Seiler et al., 1996 


gnmzql4 


NGH38 


Seiler et al., 1996 


gnmzqlS 


SWZ107 


Seiler et al., 1996 


gnmzql6 


NGH15 


Seiler et al., 1996 


gnmzql7 


NGH36 


Seiler et al., 1996 


gnmzql8 


BZ232 


Seiler et al, 1996 


gnmzql9 


BZ83 


Seiler et al., 1996 


gnmzq21 


MC58 


Virji etal., 1992 


Gr up A 




gnmzq22 


205900 


Our collection 
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gnmzq23 F6124 
z2491 Z2491 


Our collection 
Maiden et al, 1998 


Group C 

gnmzq24 90/18311 
gnmzq25 93/4286 


Our collection 
Our collection 


Others 

gnmzq26 A22 (group W) Maiden et aL, 1998 
gnmzq27 E26 (group X) Maiden et al. y 1998 
gnmzq28 860800 (group Y) Maiden et ah, 1998 
gnmzq29 E32 (group Z) Maiden et aL, 1998 
gnmzq31 N. lact arnica Our collection 


Gonococcus 

gnmzq32 Ng F62 
gnmzq33 Ng SN4 


Maiden et al., 1998 
Our collection 


fal090 FA1090 


Dempsey et al. 1991 


References: 




Seiler A. etal, Mol. Microbiol., 1996, 19(4): 841 -856. 

Maiden R. et aL 9 Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. etaL, Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. etaL, J. Bacterid., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3149> 

MKPLILGLAAVIiALSACQVRKAPDLDYTSFKESKPASILVVPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ01 <SEQ ID 3150> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLigESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ02 <SEQ ID 3151> 

MKPLIL'GLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ03 <SEQ ID 3152> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
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AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ04 <SEQ ID 3153> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS ILWPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ05 <SEQ ID 3154> 

MKPLILGIAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ07 <SEQ ID 3155> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ08 <SEQ ID 3156> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ09 <SEQ ID 3157> 

MKPL I LGLAAALVLSACQVQKAPDFDYTS FKESKPAS I LWPPLNESPDVNGTWGMLAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVQPEKLHQIFGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRG YQVSKTAAYNLL S PYSHNG I LKG PRFVEEQPK * 

GNMZQ10 <SEQ ID 3158> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLRQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ11 <SEQ ID 3159> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKGPRFVEEQPK* 

GNMZQ13 <SEQ ID 3160> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS ILWPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ14 <SEQ ID 3161> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNN SNSGLLGALVGAWNQIANSLT 
DRG YQV SKT AAYN LLSPYS HNGI LKG PRFVEEQPK * 

GNMZQ15 <SEQ ID 3162> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKGPRFVEEQPK* 



GNMZQ16 <SEQ ID 3163> 
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MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ17 <SEQ ID 3164> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILD SVTT V S AKARLVD S RNGKELW SGSASI REG SNNSN SGLLG ALV SAWNQI AN SLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ18 <SEQ ID 3165> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ19 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGFRFVEEQPK* 

GNMZQ21 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ22 <SEQ ID 3167> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ23 <SEQ ID 3168> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ24 <SEQ ID 3169> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ25 <SEQ ID 3170> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ26 <SEQ ID 3171> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGMLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLX.GALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ27 <SEQ ID 3172> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAICARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 
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GNM2Q28 <SEQ ID 3173> 

MKPLILGLAAVLALSACQVQfCAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYXTVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ29 <SEQ ID 3174> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ31 <SEQ ID 317 5> 

MKPLILGLAAVLALSACQVQKAPDFDYTAFKE SKPAS I LWPPLNES PDVNGTWGMLAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKAAAYDLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ32 <SEQ ID 3176> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKP AS I LWPPLNES PDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ33 <SEQ ID 3177> 

MKPL I LGLAAVLALSACQVRKAPDLDYTSFKE SKPAS I LWPPLNES PDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

Z2491 <SEQ ID 3178> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 



Figure 20 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 235, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 13 



Table 4 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 287 among different strains. 
Table 4 



287 gene variability: List of used Neisseria strains 

Identification Strains Reference 
number 
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Group B 



287_2 
287_9 
287_14 
287 21 



BZ198 
NGP165 
NGH38 
MC58 



Seller et al, 1996 
Seiler et al, 1996 
Seiler et aL, 1996 
Virji etal, 1992 



z2491 



Group A 

Z2491 



Maiden etal, 1998 



Gonococcus 



fal 090 FA1 090 Dempsey et al 1 99 1 



References: 

Seiler A. et al, Mol. Microbiol., 1996, 19(4):841-856. 

Maiden R. et al, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Viqi M. et al, Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. etal, J. BacterioL, 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



287_14 <SEQ ID 3179> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPS AQGGQDMAAVS EENTGNGGAAAT DKPKNE DEGAQNDMPQNAADT D S LT PNHT PAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 
FRRSARSRRSLPAEMPLI PVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRYLTYGAEKLP 
GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 
DSGDGLHMGTQKFKAAI DGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_2 <SEQ ID 3180> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQNAADTDSLTPNHTPAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNN FLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 
FRRSARSRRSLPAEMPLI PVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRYLTYGAEKLP 
GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 
DSGDGLHMGTQKFKAAI DGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_21. <SEQ ID 3181> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNN FLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 
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ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGE PAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGIIDSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 

287_9 <SEQ ID 3182> 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPVVTEDVGEEVLPKEKKDEEA 
VSGAPQADTQDATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADTDS 
STPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAGENAGNTADQA 
ANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKVCDR 
DFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVI IYKDKSAS 
SSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYG 
AEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDFGSKS 
VDGIIDSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPT 
DAEKGGFGVFAGKKEQD* 

FA109O <SEQ ID 3183> 

MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPWAENAGEGVLPKEKKDEEA 
AGGAPQADTQDATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAAESAN 
QTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDSCNGDN 
LLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTDKPPTR 
SARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGS 
YALRVQGEPAKGEMLVGTAVYNGEVLH FHMENGRPYPSGGRFAAKVDFGSKSVDGIIDSG 
DDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPTDAEKGGFG 
VFAGKKDRD* 

Z2491 <SEQ ID 3184> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSE FEKLS DADKI SN YKKDGKNDKFVGLVADSVQMKGINQYI I FYKPKPT S FARFRRS 
ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGE PAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGI I DSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 



Figure 21 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 287, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 14 



Table 5 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 519 among different strains. 
Table 5 



519 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 








Group B 




zvOl 


S19 


NG6/RR 

1 N V_J VJ/ OO 


K. Moxon / Seller e/ a/., 1996 


zv02 




R719R 

i~>£~» 170 


K. Moxon / Seller et aL, 1996 


zv03 




1NVJ.J/ OO 


K. Moxon / Seller et aL, 1996 


7v04 




9Q7 0 


K. Moxon / Seller et aL, 1996 


Zi V \J~f 




1UUU 


R. Moxon / Seller aL, 1996 




519ass 


BZ147 


R. Moxon / Seiler et aL, 1996 


zv07 


"519 


BZ169 


R. Moxon / Seiler et aL, 1996 


zvll" 


519 


NGE31 


R. Moxon / Seiler et aL, 1996 


zvl2 


519 


NGF26 


R. Moxon / Seiler et aL, 1996 


zvl8_ 


519 


BZ232 


R. Moxon / Seiler aL, 1996 


zvl9_ 


519 


BZ83 


R. Moxon / Seiler et aL, 1996 


zv20 


_519ass 


44/76 


R. Moxon / Seiler et aL, 1996 


zv21~ 


~519ass 


MC58 


R. Moxon 


zv96 


519 


2996 


Our collection 



Group A 

zv22_519ass 205900 
z2491 519 Z2491 



R. Moxon 
R. Moxon / Maiden et aL, 1998 



Others 

zv26_5 1 9 A22 (group W) R. Moxon / Maiden et aL, 1 998 
zv27_5 1 9 E26 (group X) R. Moxon / Maiden et aL, 1 998 
zv28_519 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zv29_5 1 9ass E32 (group Z) R. Moxon / Maiden et aL, 1 998 



Gonococcus 

zv32_519 Ng F62 

fal090 519 FA1090 



R. Moxon / Maiden et aL, 1998 
R. Moxon 



References: 

Seiler A. etaL, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et aL, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090_519 <SEQ ID 3185> 

MEFFI I LLAAVA V FG FK S FW I PQQEVHVVERLGRFHRALTAGLN ILIPFI DRVAYRHSL 

KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS SKTAK * 
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Z2491_519 <SEQ ID 3186> 

MEFFI ILLAAVVVFGFKSFVVI PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK IIDSS KTAK * 

ZV01_519 <SEQ ID 3187> 

ME F F 1 1 L LVAVAV FG FK S FVV I PQQE VH WE RLGR FHRALT AG LN I L I P F I DRVA YRH S L 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN ST WAALDEAAGAWGVKVLRYE IKDLVP PQE I LRSMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV02_519 <SEQ ID 3188> 

MEFFI ILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK IIDSS KTAK * 

ZV03_519 <SEQ ID 3189> 

MEFFIIL LVAVAVFG FKS FW I PQQE VHWERLGR FHRALT AGLN I L I P FI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEER DE I N ST WSALDEAAGAWGVKVLR YE IKDLVP PQE I LRSMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV04_519 <SEQ ID 3190> 

MEFFIILLVAVAVFG FKS FW I PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN ST WSALDEAAGAWGVKVLR YE IKDLVP PQE I LRSMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV05_519 <SEQ ID 3191> 

MEFFI ILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK IIDSS KTAK* 

ZV06_519ASS <SEQ ID 3192> 

MEFFI I LLVAVAVFG FKS FWI PQQE VH WERLGRFHRALT AGLN I L I P FI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN STVFSALDEAAGAWGVKVLRYE IKDLVP PQE I LRSMQAQITAERK 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK IIDSS KTAK * 

ZV07_519 <SEQ ID 3193> 

ME FF 1 1 L LVAVAVFG FK S FW I PQQE VH WERLG R FHRALT AG LN I L I P FI DRVA YRH S L 

KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

RMELDKT FEERDE INST WAALDEAAGAWGVKVLRYE IKDLVP PQE I LRSMQAQI TAERE 

KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK IIDSS KTAK * 
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ZV11_519 <SEQ ID 3194> 

MEFFI ILLAAVAVFGFKSFVVIPQQEVHVVERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKI I DSSKTAK* 

ZV12_519 <SEQ ID 3195> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV18_519 <SEQ ID 3196> 

ME FFI I LLVAVAVFGFKS FW I PQQE VH WERLGRFHRALT AGLN I L I P FI DRVAYRH S L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK IIDSS KTAK * 

ZV19_519 <SEQ ID 3197> 

ME FFI I LLVAVAVFGFKS FW I PQQEVHVVERLGRFHRALTAGLNI LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK IIDSS KTAK * 

ZV20_519ASS <SEQ ID 3198> 

MEFFI I LLVAVAVFGFKS FWIPQQEVHWERLGRFHRALTAGLNIL I PFI DRV AYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSM 
I SAGMK 1 1 DS S KTAK* 

ZV21_519ASS <SEQ ID 3199> 

MEFFI ILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK IIDSS KTAK * 

ZV22_519ASS <SEQ ID 3200> 

MEFFIILLAAVWFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAKIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV26_519 <SEQ ID 3201> 

ME FFI I LLAAVWFG FKS FWI PQQEVH WERLGRFHRALT AGLN I L I PFI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV27 519 <SEQ ID 3202> 
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ME FFI I LLVAVAVFG FKS FVV I PQQE VHVVERLGR FHRALT AGLN I LI P FI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYII^AITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK * 

2V28_519 <SEQ ID 3203> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV29_519ASS <SEQ ID 3204> 

ME FFI I LLAAVAVFG FKS FW I PQQE VH WERLGR FHRALT AG LNILIPFI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSIWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREPEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI IDSNKTAK* 

ZV32_519 <SEQ ID 3205> 

MEFFIILIAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RME LDKT FEERDE I N STWSAL DE AAGAWGVKVLR YE I KD LV P PQE I LRAMQAQI T AERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV96519 <SEQ ID 3206> 

MEFFI I LLAAVAVFG FKS FWI PQQEVHWERLGR FHRALT AG LN I LI PFI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RME LDKT FEERDE INST WAALDE AAGAWGVKVLR YE I KDLVP PQE I LRSMQAQI T AERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DS S KT AK * 

Figure 22 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 519, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 15 



Table 6 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 919 among different strains. 
Table 6 

919 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 








Gr upB 






zmOl 


NG6/88 


R. Moxon / Seiler et al, 1 


1996 


zm02 


BZ198 


R. Moxon / Seiler et al., ' 


L996 


zm03 


NG3/88 


R. Moxon / Seiler et al., 


L996 


zm04 


297-0 


R. Moxon / Seiler et al., 


L996 


zm05 


1000 


R. Moxon / Seiler et al., 


1996 


zm06 


BZ147 


R. Moxon / Seiler et al, 


L996 


zm07 


BZ169 


R. Moxon / Seiler et al, 


L996 


zm08n 


528 


R. Moxon / Seiler et al, 


1996 


zm09 


NGP165 


R. Moxon / Seiler et al., 


1996 


zmlO 


BZ133 


R. Moxon / Seiler et al., 


1996 


zmllasbc NGE31 


R. Moxon / Seiler et al., 


1996 


zml2 


NGF26 


R. Moxon / Seiler et al., 


1996 


zml3 


NGE28 


R. Moxon / Seiler et al., 


1996 


zml4 


NGH38 


R. Moxon / Seiler et al., 


1996 


15 


SWZ107 


jx. ivioxon / oeiier ei ui., 




zml6 


NGH15 


R. Moxon / Seiler et al., 


1996 


zml7 


NGH36 


R. Moxon / Seiler et al, 


1996 


zml8 


BZ232 


R. Moxon / Seiler et al., 


1996 


zml9 


BZ83 


R. Moxon / Seiler et al, 


1996 


zm20 


44/76 


R. Moxon / Seiler et aL 9 


1996 


zm21 


MC58 


R. Moxon 




zm96 


2996 


Our collection 






Group A 






zm22 


205900 


R. Moxon 




zm23asbc F6124 


R. Moxon 




z2491 


Z2491 


R. Moxon / Maiden et al., 




Group C 






zm24 


90/18311 


R. Moxon 




zm25 


93/4286 


R. Moxon 





Others 

zm26 A22 (group W) R. Moxon / Maiden et al, 1998 
zm27bc E26 (group X) R. Moxon / Maiden et al, 1 998 
zm28 860800 (group Y) R. Moxon / Maiden et al, 1998 
zm29asbc E32 (group Z) R. Moxon / Maiden et al, 1998 
zm31asbc N. lactamica R. Moxon 

Gonococcus 

zm32asbc Ng F62 R. Moxon / Maiden et al, 1998 

zm33asbc Ng SN4 R. Moxon 



fal090 FA1090 R. Moxon 
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References: 

Seiler A. et aL, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3207> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTT G YVWQLL PNGMKPE YRP * 

22491 <SEQ ID 3208> 

MKKYL FRAALCG I AAAI LAACQSKS I QT FPQPDT SVINGPDRPVG I P DPAGTT VGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTT G YVWQLL PNGMKPE YRP* 

ZM01 <SEQ ID 3209> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLS LPHWAAQDFAKS LQS FRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP* 

ZM02 <SEQ ID 3210> 

MKKYLFRAALYGIAAAI LAACQSKS IQTFPQPDTSVINGPDRPVG I PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADPCNEHPYVSIGRYMADKGYL 
KLGQT SMQG I K S YMRQN PQRLAE VLGQN P S Y I F FRE LAG S SN DG P VG ALGT PLMGE Y AG A 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM03 <SEQ ID 3211> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVG I PDPAGTTVGGGGAV 
YTWPHLS LPHWAAQDFAKS LQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKS YMRQN PQRLAE VLGQN PS Y I FFRELAGSSNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP * 



ZM04 <SEQ ID 3212> 
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MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM05 <SEQ ID 3213> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLSCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM06 <SEQ ID 3214> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFP I YG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LG YAE DPVELFFMHI QGS GRLKT PSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM07 <SEQ ID 3215> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM08N <SEQ ID 3216> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAP I LGYAEDPVELFFMHI QGS GRLKT PSGKYIRIGYADKNEHPYVS I GRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM09 <SEQ ID 3217> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LG YAE DPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRH Y I T LGAPLFVAT AH PVTRKALNRLIMAQDTGS AI KGAVRVD Y FWG YG DE AGE LAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM10 <SEQ ID 3218> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LGYAEDPVELFFMHI QGS GRLKT PSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
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KLGQTSMQG I KS YMRQNPQRLAE VLGQN PS Y I FFRELTGS GNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM11ASBC <SEQ ID 321 9> 

c MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSVQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFP I YG I PDDFI SVPLPAGLRSGKA 

t v LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYI> 
KLGQTSMQG IKS YMRQNPQRLAE VLGQN PS Y I FFRELTGS SNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM12 <SEQ ID 3220> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYI FFRE LAGS SNDGPVGALGT PLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM13 <SEQ ID 3221> 

MKKYLFRAAL YG I AAAI LAACQS KS I QT FPQPDT S VING P DRPVG I P DPAGTTVGGGGAV 
YTWPHLSLPHWAEQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGY YE PVLKGDDRRTAQARFP I YG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM14 <SEQ ID 3222> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFP I YG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSRNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM15 <SEQ ID 3223> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDLAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNHQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM16 <SEQ ID 3224> 

* MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPGRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKS LQS FRLG CAN LKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 

{ LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQG I KSYMRQNPQRLAEVLGQN PS YI FFRELTGS SNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



ZM17 <SEQ ID 3225> 
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MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHFYVSIGKYMADKGYL 
KLGQTSMQGIKS YMRQNPQRLAEVLGQNPSYI FFRELTGS SNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM18 <SEQ ID 3226> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKS YMRQNPQRLAEVLGQNPSYIFFRE LAGS SNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

2M19 <SEQ ID 3227> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KSYMRQNPQRLAEVLGQNPSYIFFRELAGS SNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM20 <SEQ ID 3228> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM21 <SEQ ID 3229> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM22 <SEQ ID 3230> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLL PNGMK PE YRP * 

ZM23ASBC <SEQ ID 3231> 

MKKYLFRAALYGIAAAI LAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQTSMQGIKSYMRQN PQRLAEVLGQNPSYIFFRELAGSSNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTSKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKE PGY VWQLL PNGMKPE YR P * 

ZM24 <SEQ ID 3232> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQT SMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP * 

ZM25 <SEQ ID 3233> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQT SMQG IKS YMRQNPQRLAEVLGQN PS YI FFRELTGSGNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP* 

ZM26 <SEQ ID 3234> 

MKKYLFRAALYG I AAAI LAACQSK S I QT FPQPDT S VING P DRPVG I PD P AGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQT SMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM27BC <SEQ ID 3235> 

MKKYLFRAALYGISAAILAACQSKSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGI KS YMRQNPQRLAEVLGQN PS Y I FFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKE PGYVWQLL PNGMKPE YRP* 

ZM28 <SEQ ID 3236> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQT SMQG I KAYMRQN PQRLAE VLGQN PS Y I FFRELAGS SN DGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM29ASBC <SEQ ID 3237> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQS FRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQT SMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATTHPITRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPE YR P * 



ZM31ASBC <SEQ ID 3238> 
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MKKHLFRAALYGIAAAILAACQSKSIQTFPQPDTSIIKGPDRPAGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDFVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYVFFRELAGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM32ASBC <SEQ ID 3239> 

MKKHLLRSALYGIAAAILAACQSRSIQT FPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTVVPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LG YAE DPVE L FFMH IQGSGRLKT PSGKY IR IG YADKNEH PYVS I GR YMADKG YL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGGDGPVGALGTPLMGGYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM33ASBC <SEQ ID 3240> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YT WPHLSMPHWAAQD FAKS LQS FRLGCANLKNRQGWQDVC AQ AFQT P I HS FQAKRFFER 
Y FT P WQV AGNG S L AGT VTG Y YE P VLKG DGRRT E RARF P IYGIPDDFISVPL PAG LRG G KN 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPHKLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

2M96 <SEQ ID 3241> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 



Figure 23 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 919, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 16 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 7: Oligonucleotides used for PCR to amplify complete or partial ORFs 
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ORF 


primer Sequence 


Restriction 
sites 


001 


Forward 


CGCGGATCCCATATG-TGGATGGTGCTGGTCAT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGCCGTCTTGTCCCAC 


Xhol 


003 


Forward 


CGCGGATCCCATATG-GTCGTATTCGTGGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAAATCATGAACACGCGC 


Xhol 


005 


Forward 


CGCGGATCCCATATG-GACAATATTGACATGT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CATCACATCCGCCCG 


Xhol 


006 


Forward 


CGCGGATCCCATATG-CTGCTGGTGCTGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGTTCCGGCTTTGATGT 


Xhol 


007 


Forward 


CGCGGATCCCATATG-GCCGACAACAGCATCAT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAGGCGTTCATGATATAAG 


Xhol 


008 


Forward 


CGCGGATCCCATATG-AACAACAGACATTTTG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CCTGTCCGGTAAAAGAC 


Xhol 


009 


Forward 


CGCGGATCCCATATG-CCCCGCGCTGCT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGGCTTTTGCCACGTTTT 


Xhol 


Oil 


Forward 


CGCGGATCCCATATG-AAGACACACCGCAAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GGCGGTCAGTACGGT 


Xhol 


012 


Forward 


CGCGGATCCCATATG-CTCGCCCGTTGCC 


BamHI- 








Ndel 




Reverse 




Xhol 


013 


Forward 


/"-t/-^/~i/~'/~* a TOOO ATA TO OO' F'T 'TO A OO A TOOT 

CCjCCjOA 1 CCC A 1 A 1 o-CC 111 LrACC A 1 vjC 1 


Bamnl- 








JNflei 




Jxcvcrsc 


CCCClCTCa AO-PTO ATTPGnr A A A A A A ATCT 




m a 


Forward 


r^r^c^c* a tooo ata to o a oo a o a ooo a ott 


JJdillJJ.1 








Ndel 




Reverse 


CCCGCTCGAG-GACGAGGCGAACGCC 


XhoT 




Forward 


AAAGAATTC-CTGCCAGCCGGCAAGACCCCGGC 


Frr> 

U/L/L/ XVI 




Reverse 


AAACTGCAG-TCAGCGGGCGGGGACAATGCCCAT 


PstI 


023 


Forward 


AAAGAATTC-AAAGAATATTCGGCATGGCAGGC 


EcoRI 




Reverse 


AAACTGCAG-TTACCCCCAAATCACTTTAACTGA 


PstI 


025 


Forward 


AAAGAATTC-TGCGCCACCCAACAGCCTGCTCC 


EcoRI 




Reverse 


AAACTGCAG-TCAGAACGCGATATAGCTGTTCGG 


PstI 


031 


Forward 


CGCGGATCCCATATG-GTCTCCCTTCGCTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATGTAAGACGGGGACAAC 


Xhol 


032 


Forward 


CGCGGATCCCATATG-CGGCGAAACGTGC 


BamHI- 
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Ndel 




IVt" V V-I 


CCCGCTCGAG-CTGGTTTTTTGATATTTGTG 

X ' Vv V_> V 1 V_*> X V-' VJ J. JV VJ V*' X VJ VJ J. X X X X J, VJ J X X -4 V XXX V» X VJ 


Xhol 


V/«J-) 


17 r\T\xf arrl 

X VJl W OJ VI 


CGCGGATCCCATATG-GCGGCGGCAGACA 














XvC/ V t/I 


CPCGCTCGAG-ATTTGCCGCATCCCGAT 


XhoT 




"PrtflX/ El f H 


PGPGGATPPPATATG-GPPGAAAAPAGPTAPGG 


RamHT- 

XJdll 11 11 












IxCVCloC 


PPPGCTCGAG-TTTGACGATTTGGTTPAATT 

V/V'V/vJV-' X V> VJ rVVJ X x X VJiVV/ VJ/V X X X VJVJ X X V^/kil X X 


XhoT 




ir orwdru 


pnrnnATrrr atatg-ptg a agppgtgpg 

V^VJ V^ VJ VJ.r\ X v^v^V^/A, X X VJ O X vj/A-rt VJ V_^v^ VJ 1 vJV- ' vJ 


JJOJIlXXl 








lNuei 






PPPGPTPGAG-PPGGAPTGPGTATPGG 


XhoT 


Loo 


r orwaiu 


rnrnn a Trrr ata tpu a ccn a tttppp^pp a 

v^ VJJV^vJVJi^V 1 VxV^ V^^rV X r\ 1 U r\V/\_. VJA 111 v^V^VJV^V_/r\ 


"Rc*rr*t4T 
J3aITxxll- 








JNaei 




jxeverse 


PPPnrTr 1 G A G-TTPT A PGPPGT A PTGPP 

V-VvV/VjVy 1 V^ VJxAvJ~ X X 1 ^Vv VJ V^V_^ VJ x rv\^ X vJV-'Vv 


VhnT 




_r orwarci 


rnrnn atpppatatg.ppotppga appi^p 

V^ VJ VJ Vj/v X V^ V^ v_//Y X rV X U v> vVJ X V^ V^ VJj?A_/AV^ V^ V_J 


odiixrii- 








Ndel 




Reverse 


cccgctcgag-taggatgacgaggtagg 


Xhol 


041 


Forward 


cgcggatcccatatg-ttcgtgcgcgaaccgc 


BamHI- 








Ndel 




Reverse 


cccgctcgag-gcccaaaaactctttcaaa 


Xhol 


OA? 


1/ KJl W CU vA 


PGCGGATCCCATATG-ACGATGATTTGCTTGC 

\_s VJ VJ VJajW X VvvX-zi a. X .£ JL X VJ ij.V'Vj/ 1 X VJ -X X XXX VJ V ' X X VJ V/ 


BamHT- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGCAGCCTGCATTTGAC 


Xhol 


043 


Forward 


AAAAAAGGTACC-ATGGTTGTTTCAAATCAAAATATC 


Kpnl 




Reverse 


AAACTGCAG-TTATTGCGCTTCACCTTCCGCCGC 


Pst I 




J. KJl W CLL \X 


AAAAAAGGTACC-GCAAAAGTGCATGGCGGCTTGGACGGTGC 

J F J Jul »_t»-£ JL VJ V— J X X 1V>V V — * V-"-* X^ X-l 1.VJ X v P V* X X VJ V J \J VJ V_-" X X VJ VJi 1\^VI VJ X VJ V-/ 


Kpn I 




XX.CVC1 oC 


AAA A AACTGC AG- 


Pst T 

X D I X 






TTAATCCTGCAACACGAATTCGCCCGTCCG 




044 


Forward 


CGCGGATCCCATATG-CCGTCCGACTAGAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATGCGCTACGGTAGCCA 


Xhol 


046 


Forward 


AAAGAATTC-ATGTCGGCAATGCTCCCGACAAG 


Eco RI 




Reverse 


AAACTGCAG-TCACTCGGCGACCCACACCGTGAA 


Pst I 


047 


Forward 


CGCGGATCCCATATG-GTCATCATACAGGCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCCGAAAAAGCCCATTTTG 


Xhol 


048 


Forward 


AAAGAATTC-ATGCTCAACAAAGGCGAAGAATTGCC 


Eco RI 




Reverse 


AAACTGCAG-TCAAGATTCGACGGGGATGATGCC 


Pst I 


049 


Forward 


AAAGAATTC-ATGCGGGCGCAGGCGTTTGATCAGCC 


Eco RJ 




Reverse 


AAACTGCAG-AAGGCGTATCTGAAAAAATGGCAG 


Pst I 


050 


Forward 


CGCGGATCCCATATG-GGCGCGGGCTGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AATCGGGCCATCTTCGA 


Xhol 


052 


Forward 


AAAAAAGAATTC-ATGGCTTTGGTGGCGGAGGAAAC 


EcoRI 




Reverse 


AAAAAAGTCGAC-TCAGGCGGCGTTTTTCACCTTCCT 


Sail 


052a 


Forward 


AAAAAAGAATTC-GTGGCGGAGGAAACGGAAATATCCGC 


Eco RI 
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Reverse AAAAAACTGCAG-TTAGCTGTTTTTGGAAACGCCGTCCAACCC Pst I 



073 Forward CGC GGATCCCATATG -TGTATGCCATATAAGAT BamHI- 

Ndel 

Reverse CCC GCTCGAG -CACCGGATTGTCCGAC Xhol 

075 Forward CGC GGATCCCATATG -CCGTCTTACTTCATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCACCAATGCCGATTATTT Xhol 

077a Forward AAAAAA GAATTC -GGCGGCATTTTCATCGACACCTTCCT EcoRI 

Reverse AAAAAA CTGCAG -TCAGACGAACATCTGCACAAACGCAAT Pst I 

080 Forward AAA GAATTC -GCGTCCGGGCTGGTTTGGTTTTACAATTC EcoRI 
Reverse AAA CTGCAG -CTATTCTTCGGATTCTTTTTCGGG Pst I 

081 Forward AAA GAATTC -ATGAAACCACTGGACCTAAATTTCATCTG Eco RI 
Reverse AA ACTGCAG -TCACTTATCCTCCAATGCCTC Pst I 

082 Forward AA AGAATTC -ATGTGGTTGTTGAAGTTGCCTGC Eco RI 
Reverse AAA CTGCAG -TTACGCGGATTCGGCAGTTGG Pst I 

084 Forward AAA GAATTC -TATCACCCAGAATATGAATACGGCTACCG Eco RI 
Reverse AA ACTGCAG -TTATACTTGGGCGCAACATGA Pst I 

085 Forward CGC GGATCCCATATG -GGTAAAGGGCAGGACT BamHI- 

Ndel 

Reverse CCC GCTCGAG -CAAAGCCTTAAACGCTTCG Xhol 

086 Forward AAAAA AGGTACC -TATTTGGCATCAAAAGAAGGCGG Kpnl 
Reverse AAA CTGCAG -TTACTCCACCCGATAACCGCG Pst I 

087 Forward AAA GAATTC -ATGGGCGGTAAAACCTTTATGC Eco RI 
Reverse AA ACTGCAG -TTACGCCGCACACGCAATCGC Pst I 

087a Forward AAAAA AGAATTC -AAGCTATTAGGCGTGCCGATTGTGATTCA Eco RI 

Reverse AAAAAA CTGCAG -TTACGCCTGCAAGATGCCCAGCTTGCC Pst I 

088 Forward AAAAA AGAATTC -ATGTTTTTATGGCTCGCACATTTCAG Eco RI 
Reverse AAAAAA CTGCAG -TCAGCGGATTTTGAGGGTACTCAAACC Pst I 

089 Forward CGC GGATCCCATATG -CCGCCCAAAATCAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TGCGCATACCAAAGCCA Xhol 

090 Forward CGC GGATCCCATATG -CGCATAGTCGAGCA BamHI- 

Ndel 

Reverse CCC GCTCGAG -AGCAAAACGGCGGTACG Xhol 

091 Forward AA AGAATTC -ATGGAAATACCCGTACCGCCGAGTCC Eco RI 
Reverse AAA CTGCAG- TCAGCGCAGGGGGTAGCCCAAGCC Pst I 

092 Forward AA AGAATTC -ATGTTTTTTATTTCAATCCG Eco RI 
Reverse AA ACTGCAG -TCAAATCTGTTTCGACAATGC Pst I 

093 Forward AA AGAATTC -ATGCAGAATTTTGGCAAAGTGGC Eco RI 
Reverse AA ACTGCAG -CTATGGCTCGTCATACCGGGC Pst I 

094 Forward AA AGAATTC -ATGCCGTCACGGAAGCGCATCAACTC Eco RI 
Reverse AAA CTGCAG- TTATCCCGGCCATACCGCCGAACA Pst I 

095 Forward AA AGAATTC- ATGTCCTTTCATTTGAACATGGACGG Eco RI 
Reverse AA ACTGCAG -TCAACGCCGCAGGCACTAACGCCC Pst I 

096 Forward AAAGAATTC-ATGGCTCGTCATACCGGGCAGGG Eco RI 
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Reverse AA ACTGCAG -TCAAAGGAAAAGGCCGTCTGAAAAGCG Pst I 

097 Forward AAA GAATTC -ATGGACACTTCAAAACAAACACTGTTG Eco RI 
Reverse AA ACTGCAG -TCAGCCCAAATACCAGAATTTCAG Pst I 

098 Forward AA AGAATTC -GATGAACGCAGCCCAGCATGGATACG Eco RI 
Reverse AA ACTGCAG -TTACGACATTCTGATTTGGCA Pst I 

102 Forward AAAAA AGAATTC -GGCCTGATGATTTTGGAAGTCAACAC Eco RI 

Reverse AAAAA ACTGCAG -TTATCCTTTAAATACGGGGACGAGTTC Pst I 
105 Forward CG CGGATCCCATATG -TCCGCAAACGAATACG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GTGTTCTGCCAGTTTCAG Xhol 

107 Forward AAAAA AGAATTC - Eco RI 

CTGATGATTTTGGAAGTCAACACCCATTATCC 
Reverse AAAAAA CTGCAG -TTATCCTTTAAATACGGGGACGAGTTC Pst I 
107b Forward AAAAA AGAATTC - Eco RI 

GATACCCAAGCCCCCGCCGGCACAAACTACTG 
Reverse AAAAA ACTGCAG - Pst I 

TTACGCGTCGCCTTTAAAGTATTTGAGCAGGCTGGAGAC 

108 Forward AA AGAATTC -ATGTTGCCGGGCTTCAACCG Eco RI 
Reverse AA ACTGCAG -TTAGCGGTACAGGTGTTTGAAGCA Pst I 

108a Forward AAAAA AGAATTC -GGTAACACATTCGGCAGCTTAGACGGTGG Eco RI 
Reverse AA ACTGCAG -TTAGCGGTACAGGTGTTTGAAGCA Pst I 

109 Forward AA AGAATTC -ATGTATTATCGCCGGGTTATGGG EcoRI 
Reverse AA ACTGCAG -CTAGCCCAAAGATTTGAAGTGTTC Pst I 

111 Forward CG CGGATCCCATATG -TGTTCGGAACAAACCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGGAGCAGTTTTTCAAA Xhol 
114 Forward CG CGGATCCCATATG -GCTTCCATCACTTCGC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CATCCGCGAAATCGTC Xhol 

117 Forward AAAAA AGGTACC -ATGGTCGAAGAACTGGAACTGCTG Kpn I 
Reverse AA ACTGCAG -TTAAAGCCGGGTAACGCTCAATAC Pst I 

118 Forward AA AGTCGAC ATGTGTGAGTTCAAGGATATTATAAG Sail 
Reverse AA AGCATGC -CTATTTTTTGTTGTAATAATCAAATC Sph I 

121 Forward CGC GGATCCCATATG -GAAACACAGCTTTACAT BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATAATAATATCCCGCGCCC Xhol 

122 Forward CGC GGATCCCATATG -GTCATGATTAAAATCCGCA BamHI- 

Ndel 

Reverse CCCG CTCGAG -AATCTTGGTAGATTGGATTT Xhol 

125 Forward AA AGAATTC -ATGTCGGGCAATGCCTCCTCTCC Eco RI 
Reverse AAA CTGCAG -TCACGCCGTTTCAAGACG Pst I 

125a Forward AAAAA AGAATTC -ACGGCAGGCAGCACCGCCGCACAGGTTTC Eco RI 
Reverse AAAAA ACTGCAG - Pst I 

TTATTTTGCCACGTCGGTTTCTCCGGTGAACAACGC 

126 Forward CGCGGATCCCATATG-CCGTCTGAAACCC BamHI- 
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Ndel 

Reverse CCCG CTCGAG -ATATTCCGCCGAATGCC Xhol 

127 Forward AA AGAATTC -ATGGAAATATGGAATATGTTGGACACTTG Eco RI 
Reverse AA ACTGCAG -TTAAAGTGTTTCGGAGCCGGC Pst I 

127a Forward AAAAAA GAATTC -AAGGAACTGATTATGTGTCTGTCGGG Eco RI 

Reverse AA ACTGCAG -TTAAAGTGTTTCGGAGCCGGC Pst I 

128 Forward CGC GGATCCCATATG -ACTGACAACGCACT BamHI- 

Ndel 

Reverse CCCG CTCGAG -GACCGCGTTGTCGAAA Xhol 

130 Forward CG CGGATCCCATATG -AAACAACTCCGCGA BamHI- 

Ndel 

Reverse CCCG CTCGAG -GAATTTTGCACCGGATTG Xhol 

132 Forward AA AGAATTC -ATGGAACCCTTCAAAACCTTAATTTG Eco RI 

Reverse AAAAAA CTGCAG -TCACCATGTCGGCATTTGAAAAAC Pst I 

134 Forward CGC GGATCCCATATG -TCCCAAGAAATCCTC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CAGTTTGACCGAATGTTC Xhol 

135 Forward CGC GGATCCCATATG -AAATACAAAAGAATCGTATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAATTCGGTCAGAAGCAGG Xhol 

137 Forward AAAAAA GGTACC -ATGATTACCCATCCCCAATTCGATCC Kpn I 
Reverse AAAAA ACTGCAG -TCAGTGCTGTTTTTTCATGCCGAA Pst I 

137a Forward AAAAAA GAATTC -GGCCGCAAACACGGCATCGGCTTCCT Eco RI 

Reverse AAAAA ACTGCAG -TTAAGCGGGATGACGCGGCAGCATACC Pst I 

138 Forward AAAAAA GAATTC -AACTCAGGCGAAGGAGTGCTTGTGGC EcoRI 
Reverse AAAAAA TCTAGA- TCAGTTTAGGGATAGCAGGCGTAC Xbal 

141 Forward AAA GAATTC -ATGAGCTTCAAAACCGATGCCGAAATCGC Eco RI 
Reverse AA ACTGCAG -TCAGAACAAGCCGTGAATCACGCC Pst I 

142 Forward CGC GGATCCCATATG -CGTGCCGATTTCATG BamHI- 

Ndel 

Reverse CCC GCTCGAG -AAACTGCTGCACATGGG Xhol 

143 Forward AAAAA AGAATTC- Eco RI 

ATGCTCAGTTTCGGCTTTCTCGGCGTTCAGAC 

Reverse AAAAA ACTGCAG -TCAAACCCCGCCGTGTGTTTCTTTAAT Pst I 

144 Forward AAAAA AGAATTC -GGTCTGATCGACGGGCGTGCCGTAAC Eco RI 
Reverse AAAAA ATCTAGA -TCGGCATCGGCCGGCATATGTCCG Xbal 

146 Forward AAAAA AGAATTC - Eco RI 

CGCCAAGTCGTCATTGACCACGACAAAGTC 

Reverse AAAAAA CTGCAG -TTAGGCATCGGCAAATAGGAAACTGGG Pst I 

147 Forward AAAAAAGAATTC- ACTGAGCAATCGGTGGATTTGGAAAC Eco RI 
Reverse AAAAAA TCTAGA -TTAGGTAAAGCTGCGGCCCATTTGCGG Xbal 

148 Forward AAAAAA GAATTC - Eco RI 

ATGGCGTTAAAAACATCAAACTTGGAACACGC 

Reverse AAAAAA TCTAGA -TCAGCCCTTCATACAGCCTTCGTTTTG Xbal 

149 Forward CGCGGATCCCATATG-CTGCTTGACAACAAAGT BamHI- 
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150 



Reverse CCCG CTCGAG -AAACTTCACGTTCACGCC 
Forward CGCGGATCCCATATG-CAGAACACAAATCCG 



Reverse 

151 Forward 

Reverse 

152 Forward 

Reverse 

153 Forward 
Reverse 

153a Forward 
Reverse 



CCCG CTCGAG -ATAAACATCACGCTGATAGC 
AAAAAA GAATTC - 

ATGAAACAAATCCGCAACATCGCCATCATCGC 

AAAAA ACTGCAG -TCAATCCAGCTTTTTAAAGTGGCGGCG 

AAAAA AGAATTC - 

ATGAAAAACAAAACCAAAGTCTGGGACCTCCC 

AAAAAA CTGCAG -TCAGGACAGGAGCAGGATGGCGGC 

AAAAAA GAATTC -ATGGCGTTTGCTTACGGTATGAC 

AAAAA ACTGCAG -TCAGTCATGTTTTTCCGTTTCATT 

AAAAAA GAATTC -CGGACTTCGGTATCGGTTCCCCAGCATTG 

AAAAAACTGCAG- 



TTACGCCGACGAAATACTCAGACTTTTCGG 

154 Forward CGC GGATCCCATATG -ACTGACAACAGCCC 

Reverse CCCG CTCGAG -TCGGCTTCCTTTCGGG 

155 Forward AAAAAA GAATTC -ATGAAAATCGGTATCCCACGCGAGTC 
Reverse AAAAA ACTGCAG -TTACCCTTTCTTAAACATATTCAGCAT 

156 Forward AAAAAA GAATTC -GCACAGCAAAACGGTTTTGAAGC 
Reverse AAAAAA CTGCAG -TCAAGCAGCCGCGACAAACAGCCC 

157 Forward CGC GGATCCCATATG -AGGAACGAGGAAAAAC 

Reverse CCCG CTCGAG -AAAACACAATATCCCCGC 

158 Forward AAAAA AGAATTC -GCGGAGCAGTTGGCGATGGCAAATTCTGC 
Reverse AAAAA ATCTAGA -TTATCCACAGAGATTGTTTCCCAGTTC 

160 Forward CGC GGATCCCATATG -GACATTCTGGACAAAC 

Reverse CCCG CTCGAG- TTTTTGCCCGCCTTCTTT 

163 Forward AAAAA AGGTACC -ACCGTGCCGGATCAGGTGCAGATGTG 
Reverse AAAAA ATCTAGA -TTACTCTGCCAATTCCACCTGCTCGTG 

163a Forward AAAAAA GAATTC -CGGCTGGTGCAGATAATGAGCCAGAC 
Reverse AAAAA ATCTAGA- TTACTCTGCCAATTCCACCTGCTCGTG 

164 Forward CGC GGATCCCATATG -AACCGGACTTATGCC 

Reverse CCCG CTCGAG -TTTGTTTCCGTCAAACTGC 

165 Forward CGC GGATCCGCTAGC- GCTGAAGCGACAGACG 

Reverse CCCG CTCGAG -AATATCCAATACTTTCGCG 
206 Forward CGC GGATCCCATATG -AAACACCGCCAACCGA 

Reverse CCC GCTCGAG -TTCTGTAAAAAAAGTATGTGC 
209 Forward CGCGGATCCCATATG-CTGCGGCATTTAGGA 



Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 
EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

Xbal 

BamHI- 

Ndel 

Xhol 

Kpnl 

Xbal 

EcoRI 

Xbal 

BamHI- 

Ndel 

Xhol 

BaroHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 
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Reverse CCCG CTCGAG -TACCCCTGAAGGCAAC 

211 Forward AAAAAAGAATTC-ATGTTGCGGGTTGCTGCTGC 
Reverse AAAAA ACTGC AG -CT ATCCTGCGG ATTGGC ATTG A A A 

212 Forward CG CGGATCCCATATG -GACAATCTCGTATGG 

Reverse CCCGCTCGAG-AGGGGTTAGATCCTTCC 

215 Forward CGC GGATCCCATATG -GCATGGTTGGGTCGT 

Reverse CCCGCTCGAG-CATATCTTTTGTATCATAAATC 

216 Forward CG CGG ATCCC AT ATG -GC AATGGC AG AAA A CG 

Reverse CCC GCTCGAG -TACAATCCGTGCCGCC 

217 Forward CGC GGATCCCATATG -GCGGATGACGGTGTG 

Reverse CCCGCTCGAG-ACCCCGAATATCGAATCC 

218 Forward CG CGGATCCCATATG -GTCGCGGTCGATC 

Reverse CCC GCTCGAG -TAACTCATAGAATCCTGC 

219 Forward CGC GGATCCGCTAGC -ACGGCAAGGTTAAG 

Reverse CCCG CTCGAG -TTT AAACC ATCTCCTC A A A A C 
223 Forward CGC GGATCCCATATG -GAATTCAGGCACCAAGTA 

Reverse CCC GCTCGAG -GGCTTCCCGCGTGTC 

225 Forward CGC GGATCCCATATG -GACGAGTTGACCAACC 

Reverse CCC GCTCGAG -GTTCAGAAAGCGGGAC 

226 Forward AAAGAATTC-CTTGCGATTATCGTGCGCACGCG 
Reverse AA ACTGCAG -TCAAAATCCCAAAACGGGGAT 

228 Forward CGC GGATCCCATATG -TCGCAAGAAGCCAAACAG 

Reverse CCCG CTCGAG -TTTGGCGGCATCTTTCAT 

229 Forward CGC GGATCCCATATG -CAAGAGGTTTTGCCC 

Reverse CCCGCTCGAG-ACACAATATAGCGGATGAAC 

230 Forward CGCGGATCCCATATG-CATCCGGGTGCCGAC 



Reverse CCCGCTCGAG-AAGTTTGGCGGCTTCGG 
Forward AAAAAA GAATTC -ATGTACGCTAAAAAAGGCGGTTTGGG 
Reverse AAAAAACTGCAG-TCAAGGTTTTTTCCTGATTGCCGCCGC 
232a Forward AAAAA AGAATTC -GCC AAGGCTGCCGATAC AC A A A TTG A 
Reverse AAAAAACTGCAG-TTAAACATTGTCGTTGCCGCCCAGATG 
Forward CGC GGATCCCATATG -GCGGACAAACCC A AG 



232 



233 



Reverse CCCGCTCGAG-GACGGCATTGAGCAG 
234 Forward CGC GGATCCC AT ATG -GCCGTTTCACTG A CGG 



Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 
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JNael 




iveverse 


nrrr a a hptt- a pprrrTnr; a tt^pp a th 

UV/LLAnUUl 1 -/W^VJVJ I luuAl 


TT • J TTT 

rima ill 


o r" 

235 


Forward 


CGCUGA 1 LttA 1 A 1 G-GCC 1 GCCAAGTTCAAA 


BamHI- 








Ndel 




xvevorse 


pppoptpo a o JTTTrw^r;pTnPTPTTP 

V^v-/V^\jrv^ 1 V-.VJ/'VVJ- 111 vJVJVJV' 1 VJV^ 


AilOl 


236 


Forward 


COCCjGATCLCATATG-GCGCGTTTCGCCTT 


BamHI- 








Ndel 




.Keverse 


V_.v^V^VJV^ 1 v^vj/\VJ-/\ 1 VJVJVJ 1 v^Ov^VJV^VJV^VvVJ 1 


Xnol 


238 


Forward 


CGCGGATCCGCTAGC-AACGGTTTGGATGCCCG 


BamHI- 








Nhel 




Reverse 


La^LAjL, 1 v^Vj/\vj- 1 1 1 vj i L> 1 AAvj 1 1 LL 1 VjA 1 A 1 vj 


Xhol 


239 


Forward 


CCGGAATTCTACATATG-CTCCACCATAAAGGTATTG 


EcoRI- 








Ndel 




Reverse 


LttuL 1 tuAu- 1 vjvj 1 uAAuAuLuu 111 AG 


Xhol 


240 


Forward 


CGCGGATCCCATATG-GACGTTGGACGATTTC 


BamHI- 








Ndel 




Reverse 


r~^{~ x r^r^r^ r TC*f~z \ rz a a a r^r^r^r* a tt a r^r^r^r^ a tv-* 
LLLuL i COALr- AAALuCL A 1 1 AGCCG A 1 G 


Xhol 


241 


Forward 


CCGGAATTCTACATATG-CCAACACGTCCAACT 


EcoRI- 








Ndel 




Reverse 


rrrnPTm a n Pi a a TPrnrrTPT a atta a tp 
L.L'L'VJV^ 1 v^vj/\vj-vj/v/\ 1 vjL.vjL'V^ lul /VA 1 1AA1L 


Xhol 


242 


Forward 


CGCGGATCCCATATG-ATCGGCAAACTTGTTG 


BamHI- 








Ndel 




Reverse 


pjppp a a pptt a ppn ata c^ncwcrir* A 

ULLvnAUL 1 1 -/\V^ V.VJ/Y 1 /YV-/VJVJ 1 L/VJL,/\VJ 


Hmdlll 


243 


Forward 


CGCGGATCCCATATG-ACGATTTTTTCGATGCTGC 


BamHI- 








Ndel 




Reverse 


v^LL^vjv^ 1 Lvj/Vvj-L>vj/\L' 1 luul IAL.CvjL.Vj 


VU ^ T 

Xhol 


244 


Forward 


CGCGGATCCCATATG-CCGTCTGAAGCCC 


BamHI- 








Ndel 




Reverse 


L-CL.VJL, 1 L/LrALj~ I I I 1 1 1 L-VjLt 1 ALjLjvjGA ill 


Xhol 


246 


Forward 


CGCGGATCCCATATG-GACATCGGCAGTGC 


BamHI- 








Ndel 




rceverse 


pppr^PTPr^APi r*r*c { (~ir*rzr % 'Trzr* f Tr r rz as^ 

V^LxV^vjI-/ 1 l^vj/Yvj-v^v^v^vjv^vjL^ 1 LrU 1 LjLjALt 


Xhol 


247 


Forward 


CGCGGATCCCATATG-GTCGGATCGAGTTAC 


BamHI- 








Ndel 




Reverse 


L^Uv^LrL, 1 L^LrALr-AALr i vj 1 ILIvjI 1 1 uLvjLA 


XT'! T 

Xhol 


248 


Forward 


CGCGGATCCCATATG-CGCAAACAGAACACT 


BamHI- 








Ndel 




Reverse 


CL-CLtL, 1 CLrA«a-C 1 LA 1 LA 1 1 A 1 1 CjC 1 AAUA 


Xhol 


249 


Forward 


CGCGGATCCCATATG-AAGAATAATGATTGCTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTCCCGACCTCCGAC 


Xhol 


251 


Forward 


CGCGGATCCCATATG-CGTGCTGCGGTAGT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TACGAAAGCCGGTCGTG 


Xhol 


253 


Forward 


AAAAAAGAATTC-ATGATTGACAGGAACCGTATGCTGCG 


EcoRI 




Reverse 


AAAAAACTGCAG-TTATTGGTCTTTCAAACGCCCTTCCTG 


PstI 
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253a Forward AAAAAAGAATTC-AAAATCCTTTTGAAAACAAGCGAAAACGG Eco RI 



Reverse AAAAA ACTGCAG -TTATTGGTCTTTCAAACGCCCTTCCTG Pst I 

254 Forward AAAAAA GAATTC -ATGTATACAGGCGAACGCTTCAATAC Eco RI 
Reverse AAAAA ATCTAGA -TCAGATTACGTAACCGTACACGCTGAC Xba I 

255 Forward CGC GGATCCCATATG -GCCGCGTTGCGTTAC BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATCCGCAATACCGACCAG Xhol 

256 Forward CGC GGATCCGCTAGC -TTTTAACACCGCCGGAC BamHI- 

Nhel 

Reverse CCC GCTCGAG -ACGCCTGTTTGTGCGG Xhol 

257 Forward CGC GGATCCCATATG -GCGGTTTCTTTCCTG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GCGCGTGAATATCGCG Xhol 

258 Forward AAAAAA GAATTC -GATTATTTCTGGTGGATTGTTGCGTTCAG EcoRI 
Reverse AAAAAA CTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG Pst I 

258a Forward AAAAAA GAATTC -GCGAAGGCGGTGGCGCAAGGCGA Eco RI 

Reverse AAAAAA CTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG Pst I 

259 Forward CGC GGATCCCATATG -GAAGAGCTGCCTCCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCTTTTCCGGCGTTT Xhol 

260 Forward CGC GGATCCCATATG -GGTGCGGGTATGGT BamHI- 

Ndel 

Reverse CCC GCTCGAG -AACAGGGCGACACCCT Xhol 

261 Forward AAAAA AGAATTC -CAAGATACAGCTCGGGCATTCGC EcoRI 
Reverse AAAAAA CTGCAG -TCAAACCAACAAGCCTTGGTCACT Pst I 

263 Forward CGC GGATCCCATATG -GCACGTTTAACCGTA BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCGTAAGCCTGCAATT Xhol 

264 Forward AAAAAA GGTACC -GCCGACGCAGTGGTCAAGGCAGAA Kpnl 
Reverse AA ACTGCAG -TCAGCCGGCGGTCAATACCGCCCG Pst I 

265 Forward AAAAA AGAATTC -GCGGAGGTCAAGAGAAGGTGTTTG EcoRI 
Reverse AAAAA ACTGCAG -TTACGAATACGTCGTCAAAATGGG Pst I 

266 Forward AA AGAATTC -CTCATCTTTGCCAACGCCCCCTTC Eco RI 
Reverse AA ACTGCAG -CTATTCCCTGTTGCGCGTGTGCCA Pst I 

267 Forward AA AGAATTC -TTCTTCCGATTCGATGTTAATCG Eco RI 
Reverse AA ACTGCAG -TTAGTAAAAACCTTTCTGCTTGGC Pst I 

269 Forward AA AGAATTC -TGCAAACCTTGCGCCACGTGCCC Eco RI 
Reverse AAA CTGCAG -TTACGAAGACCGCAACGAAAGGCAGAG Pst I 

269a Forward AAAAAA GAATTC -GACTTTATCCAAAACACGGCTTCGCC Eco RI 

Reverse AA ACTGCAG -TTACGAAGACCGCAACGAAAGGCAGAG Pst I 

270 Forward AAA GAATTC -GCCGTCAAGCTCGTTTTGTTGCAATG EcoRI 
Reverse AAA CTGCAG -TTATTCGGCGGTAAATGCCGTCTG Pst I 

271 Forward CGC GGATCCCATATG -CCTGTGTGCAGCTCGAC BamHI- 

Ndel ' 

Reverse CCC GCTCGAG -TCCCAGCCCCGTGGAG Xhol 
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272 Forward AA AGAATTC -ATGACCGCAAAGGAAGAACTGTTCGC 
Reverse AAA CTGCAG -TCAGAGCAGTTCCAAATCGGGGCT 

273 Forward AA AGAATTC -ATGAGTCTTCAGGCGGTATTTATATACCC 
Reverse AA ACTGCAG -TTACGCGTAAGAAAAAACTGC 

274 Forward CGCGGATCCCATATG-ACAGATTTGGTTACGGAC 



276 



Reverse 
Forward 



Reverse 

277 Forward 
Reverse 

277a Forward 
Reverse 

278 Forward 
Reverse 

278a Forward 
Reverse 

279 Forward 

Reverse 

280 Forward 
Reverse 

281 Forward 
Reverse 

281a Forward 
Reverse 

282 Forward 
Reverse 

283 Forward 



CCCG CTCGAG -TTTGCTTTCAGTATTATTGAA 
AAAAAA GAATTC - 

ATGATTTTGCCGTCGTCCATCACGATGATGCG 

AAAAA ACTGCAG -CTACACCACCATCGGCGAATTTATGGC 

AAAAAAGAATTC-ATGCCGCGCTTTGAGGACAAGCTCGTAGG 

AAAAAA CTGCAG -TCATAAGCCATGCTTACCTTCCAACAA 

AAAAA AGAATTC -GGGGCGGCGGCTGGGTTGGACGTAGG 

AAAAA ACTGCAG -TCATAAGCCATGCTTACCTTCCAACAA 

AAAAAA GGTACC -GTCAAAGTTGTATTAATCGGGCCTTTGCC 

AAAAA ACTGCAG -TCATTCAACCATATCAAATCTGCC 

AAAAA AGAATTC -AAAACTCTCCTAATTCGTCATAGTCG 

AAAAA ACTGCAG -TCATTCAACCATATCAAATCTGCC 

CG CGGATCCCATATG -TTGCCTGCAATCACGATT 

CCCG CTCGAG -TTTAGAAGCGGGCGGCAA 

AAAAAA GGTACC -GCCCCCCTGCCGGTTGTAACCAG 

AAAAA ACTGCAG -TTATTGCTTCATCGCGTTGGTCAAGGC 

AAAAAA GAATTC -GCACCCGTCGGCGTATTCCTCGTCATGCG 

AAAAA ATCTAGA -GGTCAGAATGCCGCCTTCTTTGCCGAG 

AAAAA AGAATTC -TCCTACCACATCGAAATTCCTTCCGG 

AAAAA ATCTAGA -GGTCAGAATGCCGCCTTCTTTGCCGAG 

AAAAAA GAATTC -CTTTACCTTGACCTGACCAACGGGCACAG 

AAAAAA CTGCAG -TCAACCTGCCAGTTGCGGGAATATCGT 

CGCGGATCCCATATG-GCCGTCTTTACTTGGAAG 



Reverse CCC GCTCGAG -ACGGCAGTATTTGTTTACG 
284 Forward CGC GGATCCCATATG -TTTGCCTGCAAAAGAATCG 

Reverse CCC GCTCGAG -CCGACTTTGCAAAAACTG 

286 Forward CG CGGATCCCATATG -GCCGACCTTTCCGAAAA 

Reverse CCC GCTCGAG -GAAGCGCGTTCCCAAG 

287 Forward CCG GAATTC T AGCTAGC -CTTTCAGCCTGCGGG 

Reverse CCCG CTCGAG -ATCCTGCTCTTTTTTGCC 

288 Forward CG CGGATCCCATATG -CACACCGGACAGG 

Reverse CCCG CTCGAG -CGTATCAAAGACTTGCGT 
290 Forward CGCGGATCCCATATG-GCGGTTTGGGGCGGA 



EcoRl 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

Kpnl 

PstI 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

Kpnl 

PstI 

Eco RI 

Xbal 

EcoRI 

Xbal 

EcoRI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 
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Ndel 

Reverse CCCG CTCGAG -TCGGCGCGGCGGGC Xhol 
292 Forward CGC GGATCCCATATG -TGCGGGCAAACGCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTGATTTTTGCGGATGATTT Xhol 

294 Forward AAAAAA GAATTC -GTCTGGTCGATTCGGGTTGTCAGAAC Eco RI 
Reverse AAAAAA CTGCAG -TTACCAGCTGATATAAAACATCGCTTT Pst I 

295 Forward CGC GGATCCCATATG -AACCGGCCGGCCTCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGATATTTGATTCCGTTGC Xhol 

297 Forward AAAAAA GAATTC -GCATACATTGCTTCGACAGAGAG Eco RI 
Reverse AAAAA ACTGCAG -TCAATCCGATTGCGACACGGT Pst I 

298 Forward AAAAAA GAATTC -CTGATTGCCGTGTGGTTCAGCCAAAACCC Eco RI 
Reverse AAAAAACTGCAG-TCATGGCTGTGTACTTGATGGTTGCGT Pst I 

299 Forward CGC GGATCCGCTAGC -CTACCTGTCGCCTCCG BamHI- 

Nhel 

Reverse CCCG CTCGAG -TTGCCTGATTGCAGCGG Xhol 
302 Forward AAAAAA GAATTC -ATGAGTCAAACCGATACGCAACG Eco RI 

Reverse AAAAAACTGCAG-TTAAGGTGCGGGATAGAATGTGGGCGC Pst I 

305 Forward AAAAA AGGTACC -GAATTTTTACCGATTTCCAGCACCGGA Kpn I 
Reverse AAAAA ACTGCAG -TCATTCCCAACTTATCCAGCCTGACAG Pst I 

305a Forward AAAAAA GGTACC -TCCCGTTCGGGCAGTACGATTATGGG Kpn I 

Reverse AAAAA ACTGCAG -TTACAAACCGACATCATGCAGGGTGAA Pst I 

306 Forward CGC GGATCCCATATG -TTTATGAACAAATTTTCCC BamHl- 

Ndel 

Reverse CCCG CTCGAG -CCGCATCGGCAGAC Xhol 
308 Forward CGC GGATCCCATATG -TTAAATCGGGTATTTTATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCCGCCATTCCCTGC Xhol 

311 Forward AAAAAA GGTACC -ATGTTCAGTTTTGGCTGGGTGTTT Kpn I 
Reverse AA ACTGCAG -ATGTTCATATTCCCTGCCTTCGGC Pst I 

312 Forward AAAAA AGGTACC -ATGAGTATCCCATCCGGCGAAATT Kpn I 
Reverse AA ACTGCAG -TCAGTTTTTCATCGATTGAACCGG Pst I 

313 Forward AAAAA AGAATTC -ATGGACGACCCGCGCACCTACGGATC Eco RI 
Reverse AAAAA ACTGCAG -TCAGCGGCTGCCGCCGATTTTGCT Pst I 

401 Forward CGC GGATCCCATATG -AAGGCGGCAACACAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCTTACGTTTTTCAAAGCC Xhol 

402 Forward AAAAAAGAATTC-GTGCCTC AGGC ATTTTC ATTT ACCCTTG C Eco RI 
Reverse AAAAA ATCTAGA -TTAAATCCCTCTGCCGTATTTGTATTC Xba I 

402a Forward AAAAAA GAATTC -AGGCTGATTGAAAACAAACACGG Eco RI 

Reverse AAAAA ATCTAGA -TTAAATCCCTCTGCCGTATTTGTATTC Xba I 

406 Forward CGC GGATCCCATATG -TGCGGGACACTGACAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGGTTGTCCTTGTCTATG Xhol 
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501 


Forward 


CCjCCjCj A 1 CCC A 1 A 1 (j-(jL AGGCGG AG A 1 GGC 


O ^ TTT 

BamHi- 








JNael 




Reverse 




-AJlOl 


502 


Forward 


CGCGGATGCCATATG-GTAGACGCGCTTAAGCA 


BairiHl- 








Ndel 




Reverse 


CuCvju 1 v^O/\u-/\uL. l uLA 1 uuLUuLu 


A1101 


503 


Forward 


CGCGGATCCGATATG-TGTTCGGGGAAAGGCG 


BamHl- 








Nael 




Reverse 




AilOl 


504 


Forward 


CGCGGATCCCATATG-AGCGATATTGAAGTGACG 


BamHI- 








iNdel 




Reverse 


LrCCCAAuC 1 1 - 1 Lr/V 1 1 LAAu 1 LL 1 1 UCCu 


xiinalll 


505 


Forward 


CGCGGATCCCATATG-TTTCGTTTACAATTCAGG 


BamHI- 








Ndel 




Reverse 


r^r^r^r^ p^tp^p- 1 a p - * p^p^p^p^/^t^tt , 'T"t i a t a r*r*r*r* 
LLLut 1 LvjALj-LuuLu 1 1 1 1 A 1 Autuu 


XaOl 


510 


Forward 


CGCGGATCCCATATG-CCTTCGCGGACAC 


BamHI- 








Ndel 




Reverse 


LLLuL 1 LuAu-uLuLAL 1 uuLAuLu 


Xnol 


512 


Forward 


CGCGGATCCCATATG-GGACATGAAGTAACGGT 


BamHI- 








Ndel 




Reverse 


c y r % r*nc* n cc*ci ap a cm a a t a rT^r x T w Tnr^ a /~*pi 
LLLUL 1 LuAu- AuuAA 1 AuCL 1 1 luALu 


Xnol 


515 


Forward 


CGCGGATCCCATATG-GAGGAAATAGCCTTCGA 


BamHI- 








XT J _T 

Ndel 




Reverse 


r s r % ccxc sr vccx api a a a Tnppnr a a apip^atp^ 

LLLUL 1 \^KJj\KJ~/\J\J\ 1 uLLutAAAutA 1 C 


Anoi 


5 lo 


Forward 


c^nc^rm a tppp ata tp tpt a ppttp a THTTPiTnri 

LULuuA 1 tLLA 1 /V 1 O- 1 \j 1 ACvj 1 1 vjr/A lul lul uu 


oanirii- 








1NU.C1 




ixeverse 


ppphptpo a n-TTTnrnnnrr T nr a tp 

vL/L/UU 1 V^VJ/ivJ" 1 X 1 ULUuULUuLn 1 L 


.AJlOl 


tit 
J i / 


Forward 


nan an a Tppr ata tp^ p^f^T a a a np:TnTr;r; a a ata 

LuuUuA 1 uLu/\ 1 /V X \J~\J\J 1 AAAUvJ lul UUAAA 1 /\ 


RamUT 

oanirii- 








1NUC1 




ive verse 


pppriPTP^ a n.ciTnr i ciccc a nrrnT 

UV^V^vJV^ 1 UUAU U 1 UV^UL'VV^AUvLu 1 


VVinT 


Die 


Forward 


A A APA A TTP PPTTTTTT A PTPPTPPP A PPPP A A P^P^ 
/vfVr\VJ/\Al 1 L-uL 1 1111 1 At 1 ut 1 1 LuAt tuuAAuu 


£SCO Kl 




Reverse 


AAA PTPP A P^ TP AAA TTTP A P"* A PTPTPPP A P^ 

AAAt 1 utAu- 1 LAAA 111 LAuAL 1 CI uLLAL 


rSt 1 


519 


Forward 


CGCGGATCCCATATG-TTCAAATCCTTTGTCGTCA 


BamHI- 








Ndel 




Reverse 


r^r^nr^r^TT^r^ a p - " 1 tttpppppttttpptpp 
CUCLrU 1 tuAu" 1 1 1 uuLuu 1 1 1 lutlut 


Xnol 


520 


Forward 


CGCGGATCCCATATG-CCTGCGCTTCTTTCA 


BamHI- 








Ndel 




Reverse 


C y C % C*CiC %r TC K C* AO ATA TTT APA TTTP A r*T/~*f~*r*r* 
LtLuL 1 LuAu-A 1 A 1 1 lAtAl 1 1 LAu 1 luul 


Xnol 


DjL i 


T7 f\ 7 Vk TVn 
J/ UI Wdl Sj. 


rnprtPi ATrrr ATATG.nrr a a a atpt at apptPtP 


j_>«iixxtx- 








Ndel 




Reverse 


CCCGCTCGAG-CATACGCCCCAGTTCC 


Xhol 


522 


Forward 


CGCGGATCCCATATG-ACTGAGCCGAAACAC 


BamHI- 








Ndel 




Reverse 


GCCCAAGCTT-TTCTGATTTCAAATCGGCA 


Hindlll 


523 


Forward 


CGCGGATCCCATATG-GCTCTGCTTTCCGCG 


BamHI- 








Ndel 
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Reverse CCCG CTCGAG - AGGGTGTGTG AT AAT AAGAAG Xhol 

525 Forward CGC GGATCCCATATG -GCCGAAATGGTTCAAATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCCGTGCATATCATAAA Xhol 

527 Forward AAA GAATTC -TTCCCTCAATGTTGCCGTTTTCG Eco RI 

Reverse AAA CTGCAG -TTATGCTAAACTCGAAACAAATTC PstI 

529 Forward CG CGGATCCGCTAGC -TGCTCCGGCAGCAAAAC BamHI- 

Nhel 

Reverse GCCC AAGCTT -ACGCAGTTCGGAATGGAG Hindlll 

530 Forward CGC GGATCCCATATG -AGTGCGAGCGCGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGACCGACTGATTCCG Xhol 

531 Forward AAAAA AGAATTC -TATGCCGCCGCCTACCAAATCTACGG EcoRI 
Reverse AAAAA ACTGCAG -TTAAAACAGCGCCGTGCCGACGACAAG PstI 

532 Forward AAAAA AGAATTC -ATGAGCGGTCAGTTGGGCAAAGGTGC Eco RI 
Reverse AAAAAA CTGCAG -TCAGTGTTCCAAGTGGTCGGTATCAAA PstI 

532a Forward AAAAAA GAATTC -TTGGGTGTCGCGTTTGAGCCGGAAGT Eco RI 

Reverse AAAAA ACTGCAG -TCAGTGTTCCAAGTGGTCGGTATCAAA PstI 

535 Forward AAA GAATTC -ATGCCCTTTCCCGTTTTCAGAC Eco RI 

Reverse AA ACTGCAG -TCAGACGACCCCGCCTTCCCC PstI 

537 Forward CG CGGATCCCATATG -CATACCCAAAACCAATCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCCTGCAAATAAAGGGTT Xhol 

538 Forward CGC GGATCCCATATG -GTCGAGCTGGTCAAAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TGGCATTTCGGTTTCGTC Xhol 

539 Forward CGC GGATCCGCTAGC -GAGGATTTGCAGGAAA BamHI- 

Nhel 

Reverse CCC GCTCGAG -TACCAATGTCGGCAAATC Xhol 

542 Forward AAA GAATTC -ATGCCGTCTGAAACCGTGTC Eco RI 
Reverse AAA CTGCAG -TTACCGCGAACCGGTCAGGAT PstI 

543 Forward AAAAA AGAATTC -GCCTTCGATGGCGACGTTGTAGGTAC Eco RI 
Reverse AAAAA ATCTAGA - Xbal 

TTAATGAAGAAGAACATATTGGAATTTTGG 

543a Forward AAAAA AGAATTC -GGCAAAACTCGTCATGAATTTGC Eco RI 

Reverse AAAAA ATCTAGA - Xbal 
TTAATGAAGAAGAACATATTGGAATTTTGG 

544 Forward AA AGAATTC -GCGCCCGCCTTCTCCCTGCCCGACCTGCACGG Eco RI 
Reverse AAA CTGCAG -CTATTGCGCCACGCGCGTATCGAT PstI 

544a Forward AAAAA AGAATTC - Eco RI 
GCAAATGACTATAAAAACAAAAACTTCCAAGTACTTGC 

Reverse AA ACTGCAG -CTATTGCGCCACGCGCGTATCGAT PstI 

547 Forward AA AGAATT C-ATGTTCGTAGATAACGGATTTAATAAAAC Eco RI 
Reverse AA ACTGCAG -TTAACAACAAAAAACAAACCGCTT PstI 

548 Forward AAAGAATTC-GCCTGCAAACCTCAAGACAACAGTGCGGC Eco RI 
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Reverse 


AAA ptpp a n Tr aha nr a nnp/rr^PTT apa Tr*rinr* 


Pot T 
rSi i 


55U 


Forward 


A A A A A A f!Tm A P 


Col T 

oai i 






a tp ata a ppp apa rr'TTTr* a tptptttp a ttttpp 
A i uA I AACLrUAU AOvj ill LA 1 CI CI 1 1 L, A I 1 I I CC 






Reverse 


A A APTHPAn TTAPPPA A A PPIPTP^P A A A A TPPPP 


Pot T 

r ST 1 


C C Art 


Forward 


A A A A A APA ATTP p T TA A A TP A PPlP*PTTTP^P A PTPPP A A A PP.P, 


xiCO Kl 




Reverse 


A A A PTPP A Cl TTAPPPA A A PPiP^TP.f"' A A A ATPPPP 


Pcf T 
r^St 1 


552 


Forward 


A A AAA APAA TTp TTriPPP^PPTTPPPTHP ATAP 
AAAAAAuAAl IL-I 1 VjOCvjCvj 1 luOLluuAlAL 






Reverse 


AAACTGCAG-TTATTTCTGATGCCTTTTCCCAAC 


Pst I 


554 


Forward 


CGCGGATCCCATATG-TCGCCCGCGCCCAAC 


BamHI- 








Ndel 




Reverse 


r^r^r^r^c^Tf^n a pi ptpppptptp apap ap 
LLtut 1 LuAu-L 1 uLLL I vj 1 tAuALAt 


Xnol 


556 


Forward 


A A APA ATTP PPPPPPPPTTTTPTTTPP A P A TPPPP 

AAAvjAAl 1C-UCvjCjvjCCjo1 1 1 lUl 1 IOvjACAICCCCj 


h-co Rl 




Reverse 


AAA PTPP A P TT A A r^/^/^TT^r^/^f^ A PPTTTPTP A PP 


Fst I 


557 


Forward 


y^i^/^PP A TPPP ATA TP* TPPPPTTTPP A PPTP A A 

CGCGGATCCLATATCj-TGCGuTTTCCACCTGAA 


BamHI- 








Ndel 




Reverse 


r^r^r^r^r^TT^r^ a p ttpppppttp a p a a a pp 
CCCGL- 1 CLrALr- 1 1 CCCjCL 1 1 LALtAAALtCj 


Xnol 


558 


Forward 


AAAGAATTC-GAGCTTTATATGTTTCAACAGGGGACGGC 


Eco Rl 




Reverse 


aaactgcag-ctaaacaatgccgtctgaaagtggaga 


Pst I 


558a 


Forward 


aaaaaagaattc-attagattctatcgccataaacagacggg 


EcoRI 




Reverse 


aaaaaactgcag-ctaaacaatgccgtctgaaagtggaga 


Pst I 


560 


Forward 


aaaaaagaattc- 


Eco Rl 






tcgcctttccgggacggggcgcacaagatggc 






Reverse 


aaaaaactgcag-tcatgcggtttcagacggcattttggc 


Pst I 


561 


Forward 


ccggaattctacatatg-atactgccagcccgt 


EcoRI- 








Ndel 




Reverse 


cccgctcgag-tttcaagctttcttcagatg 


Xhol 


562 


Forward 


cgcggatcccatatg-gcaagcccgtcgag 


BamHI- 








Ndel 




Reverse 


cccgctcgag-agaccaactccaactcgt 


Xhol 


565 


Forward 


cgcggatcccatatg-aagtcgagcgcgaaatac 


BamHI- 








Ndel 




Reverse 


cccgctcgag-ggcattgatcggcggc 


Xhol 


566 


Forward 


cgcggatcccatatg-gtcggtggcgaagagg 


BamHI- 








Ndel 




Reverse 


cccgctcgag-cgcatgggcgaagtca 


Xhol 


567 


Forward 


ccggaattctacatatg-agtgcgaacatccttg 


EcoRI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCCCCGACACCCTCG 


Xhol 




rorwaTu 


c*ac*na a tppp ata to-ptp a PtPHtp aha pp 


D am U| 

oainrii- 




Reverse 


cccgctcgag-cggcgcggcgttcag 


Ndel 
Xhol 


569 


Forward 


AAAAAAGAATTC-CTGATTGCCTTGTGGGAATATGCCCG 


EcoRI 




Reverse 


AAAAAACTGCAG-TTATGCATAGACGCTGATAACGGCAAT 


Pst I 


570 


Forward 


CGCGGATCCCATATG-GACACCTTCCAAAAAATCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGGGCGTTCATTTCTTT 


Xhol 
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571 


Forward 


AAAAAAGAATTC- 

ATGGGTATTGCCGGCGCCGTAAATGTTTTGAACCC 


EcoRI 




Reverse 


AAAAAACTGCAG-TTATGGCCGACGCGCGGCTACCTGACG 


PstI 


572 


Forward 


CGCGGATCCCATATG-GCGCAAAAAGGCAAAACC 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-GCGCAGTGTGCCGATA 


Xhol 


573 


Forward 


CGCGGATCCCATATG-CCCTGTTTGTGCCG 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-GACGGTGTCATTTCGCC 


Xhol 


574 


Forward 


CGCGGATCCCATATG-TGGTTTGCCGCCCGC 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-AACTTCGATTTTATTCGGG 


Xhol 


575 


Forward 


CGCGGATCCCATATG-GTTTCGGGCGAGG 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-CATTCCGAATCTGAACAG 


Xhol 


576 


Forward 


CGCGGATCCCATATG-GCCGCCCCCGCATCT 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-ATTTACTTTTTTGATGTCGAC 


Xhol 


577 


Forward 


CGCGGATCCCATATG-GAAAGGAACGGTGTATTT 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-AGGCTGTTTGGTAGATTCG 


Xhol 


578 


Forward 


CGCGGATCCCATATG-AGAAGGTTCGTACAG 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-GCCAACGCCTCCACG 


Xhol 


579 


Forward 


CGCGGATCCCATATG-AGATTGGGCGTTTCCAC 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-AGAATTGATGATGTGTATGT 


Xhol 


580 


Forward 


CGCGGATCCCATATG-AGGCAGACTTCGCCGA 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-CAC I TCCCCCGAAGTG 


Xhol 


581 


Forward 


CGCGGATCCCATATG-CACTTCGCCCAGC 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-CGCCGTTTGGCTTTGG 


Xhol 


582 


Forward 


AAAAAAGAATTC-TTTGGAGAGACCGCGCTGCAATGCGC 


Eco RI 




Reverse 


AAAAAATCTAGA-TCAGATGCCGTCCCAGTCGTTGAA 


Xba I 


583 


Forward 


AAAAAAGAATTC-ACTGCCGGCAATCGACTGCATAATCG 


EcoRI 




Reverse 


AAAAAACTGCAG-TTAACGGAGGTCAATATGATGAAATTG 


PstI 


584 


Forward 


AAAAAAGAATTC- 


EcoRI 




Reverse 


AAAAAACTGCAG-TCAGAACTGAACCGTCCCATTGACGCT 


PstI 


585 


Forward 


AAAAAAGGTACC-TCTTTCTGGCTGGTGCAGAACACCCTTGC 


EcoRI 




Reverse 


AAAAAACTGCAG-TCAGTTCGCACTTTTTTCTGTTTTGGA 


PstI 


586 


Forward 


CGCGGATCCCATATG-GCAGCCCATCTCG 


BamHI- 
Ndel 




Reverse 


CCCGCTCGAG-TTTCAGCGAATCAAGTTTC 


Xhol 


587 


Forward 


CGCGGATCCCATATG-GACCTGCCCTTGACGA 


BamHI- 
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Ndel 

Reverse CCCG CTCGAG -AAATGTATGCTGTACGCC Xhol 

588 Forward AAAAA AGAATTC -GCCGTCCTGACTTCCTATCAAGAACCAGG EcoRI 
Reverse AAAAAA CTGCAG -TTATTTGTTTTTGGGCAGTTTCACTTC PstI 

589 Forward AAAAAAGAATTC- EcoRI 

ATGCAACAAAAAATCCGTTTCCAAATCGAAGG 
Reverse AAAAAA CTGCAG -CTAATCGATTTTTACCCGTTTCAGGCG PstI 

590 Forward AAAAA AGAATTC -ATGAAAAAACCTTTGATTTCAGTTGCGGC EcoRI 
Reverse AAAAAA CTGCAG- TTACTGCTGCGGCTCTGAAACCAT PstI 

591 Forward AAAAAA GAATTC -CACTACATCGTTGCCAGATTGTGCGG Eco RI 
Reverse AAAAA ACTGCAG -CTAACCGAGCAGCCGGGTAACGTCGTT PstI 

592a Forward AAAAAAGAATTC-CGCGATTACACCGCCAAGCTGAAAATGGG EcoRI 



Reverse AAAAAA CTGCAG -TTACCAAACGTCGGATTTGATACG PstI 
593 Forward CGC GGATCCGCTAGC -CTTGAACTGAACGGACTC BamHI- 

Nhel 

Reverse CCCG CTCGAG -GCGGAAGCGGACGATT Xhol 
594a Forward AAAAAA GAATTC -GGTAAGTTCGCCGTTCAGGCCTTTCA EcoRI 
Reverse AAAAA ACTGCAG -TTACGCCGCCGTTTCCTGACACTCGCG PstI 

595 Forward AAAAAA GAATTC -TGCCAGCCGCCGGAGGCGGAGAAAGC EcoRI 
Reverse AAAAAA CTGCAG -TTATTTCAAGCCGAGTATGCCGCG PstI 

596 Forward CGC GGATCCCATATG -TCCCAACAATACGTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGCGTTACCGGTTTGT Xhol 

597 Forward CGC GGATCCCATATG -CTGCTTCATGTCAGC BamHI- 

Ndel 

Reverse GCCCAAGCTT-ACGTATCCAGCTCGAAG Hindlll 

601 Forward CGC GGATCCCATATG -ATATGTTCCCAACCGGCAAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAAACAATCCTCAGGCAC Xhol 

602 Forward CGC GGATCCGCTAGC -TTGCTCCATCAATGC BamHI- 

Nhel 

Reverse CCCG CTCGAG- ATGCAGCTGCTAAAAGCG Xhol 

603 Forward AAAAA AGAATTC -CTGTCCTCGCGTAGGCGGGGACGGGG EcoRI 
Reverse AAAAA ACTGCAG -CTACAAGATGCCGGCAAGTTCGGC PstI 

604 Forward CGC GGATCCGCTAGC -CCCGAAGCGCACTT BamHI- 

Nhel 

Reverse CCC GCTCGAG -GACGGCATCTGCACGG Xhol 
606a Forward AAAAAA GAATTC -CGCGAATACCGCGCCGATGCGGGCGC Eco RI 
Reverse AAAAA ACTGCAG -TTAAAGCGATTTGAGGCGGGCGATACG PstI 

607 Forward AAAAAAGAATTC- ATGCTGCTCGACCTCAACCGCTTTTC Eco RI 
Reverse AAAAA ACTGCAG -TCAGACGGCCTTATGCGATCTGAC PstI 

608 Forward AAAAA AGAATTC -ATGTCCGCCCTCCTCCCCATCATCAACCG Eco RI 
Reverse AAAAAA CTGCAG- TTAGTCTATCCAAATGTCGCGTTC PstI 

609 Forward CGC GGATCCCATATG -GTTGTGGATAGACTCG BamHI- 

Ndel 
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Reverse CCC GCTCGAG -CTGGATTATGATGTCTGTC Xhol 

610 Forward CGC GGATCCCATATG -ATTGGAGGGCTTATGCA BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGCTTCAACATCTTTGCC Xhol 

611 Forward CGC GGATCCCATATG -CCGTCTCAAAACGGG BamHI- 

Ndel 

Reverse CCC GCTCGAG -AACGACTTTGAACGCGCAA Xhol 

613 Forward CGC GGATCCCATATG -TCGCGTTCGAGCCG3 BamHI- 

Ndel 

Reverse CCC GCTCGAG -AGCCTGTAAAATAAGCGGC Xhol 

614 Forward CGC GGATCCCATATG -TCCGTCGTGAGCGGC BamHI- 

Ndel 

Reverse CCC GCTCGAG -CCATACTGCGGCGTTC Xhol 
616 Forward AAAAAA GAATTC -ATGTCAAACACAATCAAAATGGTTGTCGG EcoRI 

Reverse AAAAA ATCTAGA -TTAGTCCGGGCGGCAGGCAGCTCG Xbal 
619a Forward AAAAAA GAATTC -GGGCTTCTCGCCGCCTCGCTTGC EcoRI 

Reverse AAAAA ACTGCAG -TCATTTTTTGTGTTTTAAAACGAGATA Pst I 
622 Forward CGC GGATCCCATATG -GCCGCCCTGCCTAAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGTCCAAATGATAAATCTG Xhol 

624 Forward CGC GGATCCCATATG -TCCCCGCGCTTTTACCG BamHI- 

Ndel 

Reverse CCCG CTCGAG- AGATTCGGGCCTGCGC Xhol 

625 Forward CGC GGATCCCATATG -TTTGCAACCAGGAAAATG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGGCAAAATTACCGCCTT Xhol 
627a Forward AAAAAA GAATTC- AAAGCAGGCGAGGCAGGCGCGCTGGG Eco RI 

Reverse AAAAAA CTGCAG - Pst I 

TTACGAATGAAACAGGGTACCCGTCATCAAGGC 
628 Forward AAAAA AGGTACC- GCCTTACAAACATGGATTTTGCGTTC Kpnl 

Reverse AAAAAA CTGCAG -CTACGCACCTGAAGCGCTGGCAAA Pst I 

629a Forward AAAAA AGAATTC -GCCACCTTTATCGCGTATGAAAACGA Eco RI 

Reverse AAAAA ACTGCAG -TTACAACACCGCCGTCCGGTTCAAACC Pst I 
630a Forward AAAAA AGAATTC -GCGGCTTTGGGTATTTCTTTCGG Eco RI 

Reverse AAAAA ACTGCAG -TTAGGAGACTTCGCCAATGGAGCCGGG Pst I 
635 Forward AAAAA AGAATTC - Eco RI 

ATGACCCAGCGACGGGTCGGCAAGCAAAACCG 

Reverse AAAAA ACTGCAG -TTAATCCACTATAATCCTGTTGCT Pst I 

638 Forward AAAAA AGAATTC -ATGATTGGCGAAAAGTTTATCGTAGTTGG Eco RI 
Reverse AAAAA ACTGCAG -TCACGAACCGATTATGCTGATCGG Pst I 

639 Forward CGC GGATCCCATATG -ATGCTTTATTTTGTTCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -ATCGCGGCTGCCGAC Xhol 
642 Forward CGC GGATCCCATATG -CGGTATCCGCCGCAAT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGATTGCGGGGCATTA Xhol 
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643 Forward CGC GGATCCCATATG -GCTTCGCCGTCGGCAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACCGAAAAACAGACCGC Xhol 

644 Forward AAAAA AGAATTC - Eco RI 

ATGCCGTCTGAAAGGTCGGCGGATTGTTGCCC 

Reverse AAAAA ATCT AG A -CT ACCCGC AATATCGGC AGTCC A A T A T Pst I 

645 Forward AAAAAAGAATTC-GTGGAACAGAGCAACACGTTAAATCG Eco RI 
Reverse AAAAAACTGCAG-CTACGAGGAAACCGAAGACCAGGCCGC Pst I 

647 Forward AAAAAAGAATTC-ATGCAAAGGCTCGCCGCAGACGG Eco RI 
Reverse AAAAA ACTGC AG -TTAGATTATC AGGGAT ATCCGGT A G A A Pst I 

648 Forward AAAAA AGAATTC - Eco RI 

ATGAACAGGCGCGACGCGCGGATCGAACG 

Reverse AAAAAACTGCAG-TCAAGCTGTGTGCTGATTGAATGCGAC Pst I 

649 Forward AAAAAAGAAJTC-GGTACGTCAGAACCCGCCCACCG Eco RI 
Reverse AAAAAACTGCAG-TTAACGGCGGAAACTGCCGCCGTC Pst I 

650 Forward AAAAAAGAAJTC-ATGTCCAAACTCAAAACCATCGC Eco RI 
Reverse AAAAAACTGCAG-TCAGACGGCATGGCGGTCTGTTTT Pst I 

652 Forward AAAAA AGGTACC - Kpn I 

GCTGCCGAAGACTCAGGCCTGCCGCTTTACCG 

Reverse AAAAAACTGCAG-TTATTTGCCCAGTTGGTAGAATGCGGC Pst I 

653 Forward AAAAAAGAATTC-GCGGCTTTGCCGGTAATTTTCATCGG Eco RI 
Reverse AAAAA ACTGCAG -CTATGCCGGTCTGGTTGCCGGCGGCGA Pst I 

656a Forward AAAAAAGAATTC-CGGCCGACGTCGTTGCGTCCTAAGTC Eco RI 

Reverse AAAAAACTGCAG-CTACGATTTCGGCGATTTCCACATCGT Pst I 

657 Forward AAAAAAGAATTC-GCAGAATTTGCCGACCGCCATTTGTGCGC Eco RI 
Reverse AAAAA ACTGCAG -TTATAGGGACTGATGCAGTTTTTTTGC Pst I 

658 Forward CGC GGATCCCATATG -GTGTCCGGAATTGTG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCAGAATGTTTACCGTT Xhol 

661 Forward AAAAA AGAATTC - Eco RI 
ATGCACATCGGCGGCTATTTTATCGACAACCC 

Reverse AAAAAACTGCAG-TCACGACGTGTCTGTTCGCCGTCGGGC Pst I 

663 Forward CGC GGATCCCATATG -TGTATCGAGATGAAATT BamHI- 

Ndel 

Reverse CCC GCTCGAG -GTAAAAATCGGGGCTGC Xhol 

664 Forward CGC GGATCCCATATG -GCGGCTGGCGCGGT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AAATCGAGTTTTACACCAC Xhol 

665 Forward AAAAAAGAATTC-ATGAAATGGGACGAAACGCGCTTCGG Eco RI 
Reverse AAAAAACTCCAG-TCAATCCAAAATTTTGCCGACGATTTC Pst I 

666 Forward AAAAAAGAATTC-AACTCAGGCGAAGGAGTGCTTGTGGC Eco RI 
Reverse AAAAAATCTAGA-TCAGTTT AGGGAT AGCAGGCGT AC Xbal 

667 Forward AAAAAA GAATTC - Eco m 

CCGCATCCGTTTGATTTCCATTTCGTATTCGTCCG 

Reverse AAAAAACTGCAG-TTAATGACACAATAGGCGC A AGTP p st i 
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ooy 


Forward 


AAAAAAOAA 1 I L^-A 1 Ov^OOOOOA 1 l^A 1 1 AAAAAAOAOOAOOO 


bco KJ 




Reverse 


A A A A A A C^ r Tf^.C % A TT A A /~"T A T/^/^/^TTT/*^* A T^/^T 1 /^/^ r~*r^ 

AAAAAAL 1 ULAu- 1 1 AOAO 1 A 1 LLu 111 OA 1 G 1 CGGC 


Pst I 


670a 


Forward 


AAAAAAGAATTC-AAAAACGCTTCGGGCGTTTCGTCTTC 


EcoRI 




Reverse 


AAAAAACTGCAG- 


Pst I 






TTAGGAGCTTTTGGAACGCGTCGGACTGGC 




O/l 


Forward 


OOOOO A 1 v^v^A 1 A 1 Vj- AOv^ AOO AOOO 1 AAL 


T*J TTT 

BamHI- 








JNaei 




V vX 


CCCGCTCGAG-AGCAACTATAAAAACGrAAG 


WinT 


filJ 


J7 \Jl W Ol U 


CGCGGATCCCATATG-AGGAAAATCCGCACC 

V_; VJ V> VJ VJi \ X V-' \~> V^-Tv X -fV X VJ /aVJVJ/Tut-TV/I X VxV-/VJ V//J.V/V/ 


OdlTlxll- 








INacl 




1VCVCI oC 


GrCGCTCGAG-ACGGGATAOGCGOTTO 

V/VvVvVJV/ 1 V^VJ/i.VJ -T^V_^ VJ VJ VJ/i. 1 /^VJ VJ V^ VJ Vj J. J. VJ 




O / J> 


r VJi W til vJ. 


A A A A A AO A ATTP- ATOHAT ATTH A A APPTTrrTTHP A 


J3CO xvl 




XVCVCIoC 


A A A A A A CTGC A G-PT A C A A A PPP A nPTP^Pnr 1 A G(lr A A 
^Tj^^T-T\-rtjrvv-/ x vJ v^-^vvj v^ l Av^AjTj\VyV/vAvJv/ 1 v^vj v^ vJv^/\vJvJf/VrV 


Pot T 

.T SI 1 




rorvvciru 


aaaaaahaa ttg~ A tg AAAAfA ^rv^pr^r , pr;rv^r;TTr , r , r , r; 


xiCO Jtvl 




Reverse 


AAAAAAt 1 OUAO- 1 OAAOOOCO 111 OGOC 1 CGTCGGG 


Pst I 


675 


Forward 


CGCGOA 1 COCA 1 A 1 O-AACACCATCGCCCC 


BamHI- 








JNael 




JVCVClbC 


PGCGGTGG AG-TTGTTCGTrTTr A A ACTOT 

\^Vw^V_/Vji\-/ A V_^VJ^VVJ 1 X X X \_/Vj X X X V^AiVn.V' X V_J x 


VU«T 


O / / d 


r UlWalU 


A A A A A AGAATTC-AGACGGCATTCCCGATGAGTCGATTTTC^A 

/A^fA-rV.fVrt^rVVj/^rV 1 x Vv x^VJ-^V^ VJ VJV_//-V 1 1 V^V^V^VJ<r\. 1 v*aU 1 V_, vJiY 1111 vJ/\ 


HCO rvl 




Reverse 


A A A A A A f~ tr T(~ic* A f~i TT A oot A tooooo aaa too a r^r^r^r+r^r^r* 
AAAAAAt 1 vjL/Avj- 1 1 ALu 1 A 1 OOOCOAAA 1 OOACOOCCOC 


Pst I 


680 


Forward 


r^r^ r^r^r^ a toooot a r^r* a r^r^ a a a i "mnp 
CCjCCjOA 1 CUOC 1 ALrL- ACGAACjCjCjC AG iTCGG 


~r% T TT 

BamHI- 








JNliel 




Reverse 


CUCijrU 1 OvjALr-UA 1 LAAAAALt 1 ULLuL 


Xnol 


681 


Forward 


AAAAAAvj AA 1 1 A 1 (j ACG ACGCCGATGGC AATC AGTGC 


Eco RI 




Reverse 


AAAAAAL 1 OC AO- 1 1 AOOO 1 CTTCCOC AAAAAAC AGC 


T% J- T 

Pst I 


683 


Forward 


CGCGGATCCCATATG-TGCAGCACACCGGACAA 


BamHI- 








Ndel 




Reverse 


OOOOOlOOAO-OAOl 11111 1CCOOA1ACG 


Xhol 


684 


Forward 


CGCGGATCCCATATG-TGCGGTACTGTGCAAAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTCGACCATCTGTTGCG 


Xhol 


685 


Forward 


CGCGGATCCCATATG-TGTTTGCTTAATAATAAACATT 


BamHI- 








Ndel 




Reverse 


LLLut 1 LuAu-t 11111 CCCCOCOOCA 


Xhol 


686 


Forward 


CGCGGATCCCATATG-TGCGGCGGTTCGGAAG 


BamHI- 








Ndel 




Reverse 


LLLuL 1 LuAu-LA 1 1 LtuA 1 1 C 1 OA 1 OAAO 


Xhol 


687 


Forward 


CGCGGATCCCATATG-TGCGACAGCAAAGTCCA 


BamHI- 








NdeT 




Reverse 


CCCGCTCGAG-CTGCGCGGCTTTTTGTT 


Xhol 


690 


Forward 


CGCGGATCCCATATG-TGTTCTCCGAGCAAAGAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TATTCGCCCCGTGTTTGG 


Xhol 


691 


Forward 


CGCGGATCCCATATG-GCCACGGCTTATATCCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGAGGCAGGAAGAAAG 


Xhol 



WO 99/57280 



PCT/US99/09346 



159 



694 


Forward 


CGCGGATCCCATATG-TTGGTTTCCGCATCCGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCTGCGTCGGTGCGGT 


Xhol 


695 


Forward 


CGCGGATCCCATATG-TTGCCTCAAACTCGTCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCGTTTGCGCACGGCT 


Xhol 


696 


Forward 


CGCGGATCCCATATG-TTGGGTTGCCGGCAGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTGATTGCCGCAATGATG 


Xhol 


700a 


Forward 


AAAAAAGAATTC-GCATCGACAGACGGTGTGTCGTGGAC 


EcoRl 




Reverse 


AAAAAACTGCAG-TTACGCTACCGGCACGACTTCCAAACC 


PstI 


701 


Forward 


CGCGGATCCCATATG-AAGACTTGTTTGGATACTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGCCGACAACAGCCTC 


Xhol 


702 


Forward 


AAAAAAGAATTC-ATGCCGTGTTCCAAAGCCAGTTGGATTTC 


EcoRI 




Reverse 


AAAAAACTGCAG-TTAACCCCATTCCACCCGGAGAACCGA 


PstI 


703 


Forward 


CGCGGATCCGCTAGC-CAAACGCTGGCAACCG 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-TTTTGCAGGTTTGATGTTTG 


Xhol 


704a 


Forward 


AAAAAAGAATTC-GCTTCTACCGGTACGCTGGCGCG 


EcoRI 




Reverse 


AAAAAACTGCAG- 


PstI 






TTAGTTTTGCCGGATAATATGGCGGGTGCG 




707 


Forward 


CGCGGATCCGCTAGC-GAAATTATTAACGATGCAGA 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-GAAACTGTAATTCAAGTTGA 


Xhol 


708 


Forward 


CGCGGATCCGCTAGC-CCTTTTAAGCCATCCAAAA 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-TTGACCGGTGAGGACG 


Xhol 


710 


Forward 


CGCGGATCCCATATG-GAAACCCACGAAAAAATC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AACGGTTTCGGTCAG 


Xhol 


714 


Forward 


CGCGGATCCCATATG-AGCTATCAAGACATCTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGGTAGGTAAATCGGAT 


Xhol 


716 


Forward 


CGCGGATCCCATATG-GCCAACAAACCGGCAAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTAGAACCGCATTTGCC 


Xhol 


718 


Forward 


CGCGGATCCCATATG-GAGCCGATAATGGCAAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GGCGCGGGCATGGTCTTGTCC 


Xhol 


720 


Forward 


CGCGGATCCCATATG-AGCGGATGGCATACC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTTGCATAGCTGTTGACCA 


Xhol 


723 


Forward 


CGCGGATCCCATATG-CGACCCAAGCCCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AATGCGAATCCGCCGCC 


Xhol 
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725 


Forward 


CGCGGATCCCATATG-GTGCGCACGGTTAAA 


BamHI- 








Ndel 




Reverse 


LLLuL 1 CLiALr- 1 1 UL 1 I A 1 LL 1 1 AAajLKj 1 1 A 


Xnol 


726 


Forward 


CGCGGATCCCATATG-ACCATCTATTTCAAAAAC 


BamHI- 








Ndel 




Reverse 


LLLuL 1 CCjA<jt-OUU<jA 1 vj 1 1 1 AOUCj 1 LC 


Xhol 


728 


Forward 


CGCGGATCCCATATG-TTTTGGCTGGGAACGGG 


BamHI- 








Ndel 




Reverse 


CCCCjC I CCjACj-Cj I OACjAAALtCj 1 CCjCCjC 


Xhol 


729 


Forward 


CGCGGATCCCATATG-TGCACCATGATTCCCCA 


BamHI- 








Ndel 




Reverse 


GCCCAACjC 1 1 -1 TTGTCGGTTTGGGTATC 


Hindlll 


731 


Forward 


CGCGGATCCGCTAGC-GCCGTGCCGGAGG 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-ACGGGCGCGGCAG 


Xhol 


732 


Forward 


CCGGAATTCTACATATG-TCGAAACCTGTTTTTAAGAA 


EcoRI- 








Ndel 




Reverse 


CCCGCTCGAG-CTTCTTATCTTTTTTATCTTTC 


Xhol 


733 


Forward 


CGCGGATCCCATATG-GCCTGCGGCGGCAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-1 CGC T TGCCTCCTTTAC 


Xhol 


734 


Forward 


CGCGGATCCCATATG-GCCGATACTTACGGCTAT 


BamHI- 








Ndel 




Reverse 


CCCGC I CGAO- 111 OACjA 1111 OAA rc AAAGAG 


Xhol 


735 


Forward 


CGCGGATCCCATATG-AAGCAGCAGGCGGTCA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AT T 1 CGGTAGCCGAGGG 


Xhol 


737 


Forward 


CGCGGATCCCATATG-CACCACGACGGACACG 


BamHI- 








Ndel 




Reverse 


CCCGC 1 COA(jr-Cr 1 CLf 1 CCjCCjCjCGGGA 


Xhol 


739 


Forward 


CGCGGATCCCATATG-GCAAAAAAACCGAACA 


BamHI- 








Ndel 




Reverse 


CCL/UU 1 CvjA<j-(jAA<jtALj 1 1 id 1 CCjACj AA 1 1 


Xhol 


740 


Forward 


CGCGGATCCCATATG-GCCAATCCGCCCGAAG 


BamHI- 








Ndel 




Reverse 


r , r , nr , PTr , p ap a a a pprppf a a a a t a /^t^ 
CCUCjC 1 UUAO-AAAULrCijrUL/AAAA 1 AG 1 kj 


Xhol 


741 


Forward 


CGCGGATCCCATATG-TGCAGCAGCGGAGGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTGCTTGGCGGCAAGGC 


Xhol 


743 


Forward 


CGCGGATCCCATATG-GACGGTGTTGTGCCTGTT 


BamHI- 








Ndel 




Reverse 


cccgctcgag-cttacggatcaaattgacg 


Xhol 


745 


Forward 


cgcggatcccatatg-ttttggcaactgaccg 


BamHI- 








Ndel 




Reverse 


cccgctcgag-caaatcagatgcctttagg 


Xhol 


746 


Forward 


cgcggatcccatatg-tccgaaaacaaacaaaac 


BamHI- 
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Nael 




Reverse 


CCCCjC 1 LuAu- 1 1 LA 1 1 CO 1 1 ACC 1 GACC 


Xhol 


747 


Forward 


CCGGAATTCTAGCTAGC-CTGACCCCTTGGG 


EcoRI- 








Nhel 




Reverse 


ULtLAAuL 1 1 - 1 1 i 1 CjA 1111 AA 1 1 GAG 1 A 1 AGAAG 


T T " JTTT 

HmdIII 


749 


Forward 


CGCGGATCCCATATG-TGCCAGCCGCCG 


BamHI- 








Ndel 




Reverse 


CGCGG 1 GGAG- 111 C AAGCGG Au 1 Al GG 


Xhol 


750 


Forward 


CGCGGATCCCATATG-TGTTCGCCCGAACCTG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTTTTTCCCCGCCGCAA 


Xhol 


758 


Forward 


CGCGGATCCCATATG-AACAATCTGACCGTGTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGGCTCAATCCTTTCTGC 


Xhol 


759 


Forward 


CGCGGATCCGCTAGC-CGCTTCACACACACCAC 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-CCAGTTGTAGCCTATTTTG 


Xhol 


763 


Forward 


CGCGGATCCCATATG-CTGCCTGAAGCATGGCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTCCGCAAATACCGTTTCC 


Xhol 


764 


Forward 


CGCGGATCCCATATG-TTTTTCTCCGCCCTGA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCGCTGCCTAAAGCTTTC 


Xhol 


765 


Forward 


CGCGGATCCCATATG-TTAAGATGCCGTCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ACGCCGACGTTTTTTATTAA 


Xhol 


767 


Forward 


y - -* y^* y~^ * * % y— • y** y—w a ■ ■ ■ a ■ ft ■ * y^* i y*- y"-\ y—*t A A y*** V"1 y™» y— * * * 

CGCGGATCCCATATG-CTGACGGAAGGGGAAG 


BamHI- 








Ndel 




Reverse 


CGCGCTCGAG-TTTC 1 G 1 AGAGCAGGGG 


Xhol 


768 


Forward 


CGCGGATCCCATATG-GCCCCGCAAAAACCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCATCCCTTTTTTGAGC 


Xhol 


770 


Forward 


CGCGGATCCCATATG-TGCGGCAGCGGCGAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGTTTGTCGAGATTTTC 


Xhol 


771 


Forward 


CGCGGATCCCATATG-TCCGTATATCGCACCTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CGGTTCTTTAGGTTTGAG 


Xhol 


IT) 
/ /Z 


rorwarci 




r>anirll- 








Ndel 




Reverse 


CCCGCTCGAG-CAATGCCGACATCAAACG 


Xhol 


774 


Forward 


CGCGGATCCCATATG-TCCGTTTCACCCGTTCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCGTTTGCGCACGGCT 


Xhol 


790 


Forward 


CGCGGATCCCATATG-GCAAGAAGGTCAAAAAC 


BamHI- 








Ndel 
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n ~. lf > rc ~ crrac'TCdA n-CidcciTTnTTcnn a tttpg 

IVCVCrSG ^LiV^VJVv 1 VJ.TYVJ VJVJV^VJ 1 lVJi 1 ^UVJAi 1 1. V^VJ 


XhoT 


900 


Forward CCjCGCjA 1 LCLA 1 A 1 vj-UCLj 1 C 1 LrAAA I IjUv^Lj 


oamrii- 






IN del 




I7airo.roo OCCdCTCCl AO- AT A TOO A A A AOTCTOTTOTO 




901 


_ j /~i/-</-i/^/~' a Terr* a x a x/ - * rrrr a nTTTPfi a 

Forward C(jC(jvj A 1 LLLA 1 A 1 Cj-C CUvjA I I I 1 I UVjA 1 vj 


"DorvitTT 






iNUCx 




1? pvptsp CCCGCTCG AG- AA AATGG AAC AATACC AGG 


Xhol 




Forward rrGG A ATTCT AC AT ATG-TTGCACTTTCAAAGGATAATC 


EcoRI- 




2 


Ndel 




Reverse CCCGCTCGAG-AAAAATGTACAATGGCGTAC 


Xhol 


903 


Forward CCGGAATTCTAGCTAGC-CAGCGTCAGCAGCACAT 


EcoRI- 






Nhel 




Reverse CCCGCTCGAG-GAAACTGTAATTCAAGTTGAA 


Xhol 


904 


Forward AAAAAAGGTACC-ATGATGCAGCACAATCGTTTC 


Kpn I 




Reverse AAACTGCAG-TTAATATCGATAGGTTATATG 


Pst T 

X 51 A 


904a Forward AAAAAAGAATTC-CGGCTCGGCATTGTGCAGATGTTGCA 


Fro RT 






Pvjt T 
AO I X 


905 


T7««« r «*-^ rrrrn a Trrr a t a a apa a a ATATArrnrATr' 








Ndel 




Reverse CrCGCTCGAG-CCACTGATAACCGACAGAT 

,A^,VV V O VV V_»/ v* V/ VJ V_> J. V—^ J. V V-J Vwi X V_^ X VJ A. X X .i XX \vVJ4 L\JJ X X 


-/VllvJX 




Forward rGCGGATCCCATATG-GGCGCGCAACGTGAG 

J_ vJl VV ul V * V V .1 V 1 VJ J. X X V_^.ZX X ./ L. X \_l VJ VJ VvJ VVJ V_/.£ X^ 1VVJ X VJ ilVJ 


RnmFTT- 






Ndel 




Reverse CCCGCTCGAG-ACGCCACTGCCAGCG 


Xhol 


908 


Forward AAAGAATTC-GCAGAGTTAGTAGGCGTTAATAAAAATAC 


EcoRI 




Reverse AAACTGCAG-TTAATATGGTTTTGTCGTTCG 


PstI 


909 


Forward CGCGGATCCCATATG-TGCGCGTGGGAAACTTAT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCGGTTTTGAAACTTTGGTTTT 


Xhol 


910 


Forward AAAGAATTC-GCATTTGCCGGCGACTCTGCCGAGCG 


EcoRI 




Reverse AAACTGCAG-TCAGCGATCGAGCTGCTCTTT 


A Ol A 


911 


Forward AAAGAATTC-GCTTTCCGCGTGGCCGGCGGTGC 


Fro RT 




Reverse AAAAAACTGCAG-GTCGACTTATTCGGCGGCTTTTTCCGC 


Pet T 


912 


Forward AAAAAAGAATTC- 


rjrt- t>t 
HCO xvl 




CAAATCCGTCAAAACGCCACTCAAGTATTGAG 






Reverse AAAAAACTGCAG-TTACAGTCCGTCCACGCCTTTCGC 


A Ol A 


913 


Forward CGCGGATCCCATATG-GAAACCCGCCCCGC 


RamFfT- 

JjCUIIXAX 






Ndel 




Reverse CCCGCTCGAG-AGGTTGTGTTCCAGGTTG 


XhoT 

AJ1VJ A 


915 


Forward CGCGGATCCCATATG-TGCCGGCAGGCGGAA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGAAAATATAGGTATCAGG 


Xhol 


914 


Forward AAAGAATTC-GACAGAATCGGCGATTTGGAAGCACG 


EcoRI 




Reverse AAACTGCAG-CTATATGCGCGGCAGGACGCTCAACGG 


PstI 


916 


Forward CGCGGATCCCATATG-GCAATGATGGCGGCTG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGGCGGCATCTTTCAT 


Xhol 
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917 Forward AAAAAA GAATTC -CCTGCCGAAAAACCGGCACCGGC Eco RI 

Reverse AAAAAACTGCAG-TTATTTCCCCGCCTTCACATCCTG Pst I 

919 Forward CGC GGATCCCATATG -TGCCAAAGCAAGAGCATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGGGCGGTATTCGGG Xhol 

920 Forward CGC GGATCCCATATG -CACCGCGTCTGGGTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATGGTGCGAATGACCGA Xhol 

921 Forward AAAAAA GAATTC -TTGACGGAAATCCCCGTGAATCC Eco RI 
Reverse AAAAA ACTGC AG -TC ATTTC AAGGGCTGC ATCTTC A T p s t I 

922 Forward. CG CGGATCCGCTAGC -TGTACGGCGATGGAGGC BamHI- 

2 Nhel 
Reverse CCC GCTCGAG -CAATCCCGGGCCGCC Xhol 

923 Forward CGC GGATCCCATATG -TGTTACGCAATATTGTCCC BamHI- 

Nhel 

Reverse CCCG CTCGAG -GGACAAGGCGACGAAG Xhol 

925 Forward CGC GGATCCCATATG -AAACAAATGCTTTTAGCCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCGTTGCATTTGATTTC Xhol 

926 Forward CGC GGATCCCATATG -TGCGCGCAATTACCTC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TCTCGTGCGCGCCG Xhol 

927 Forward CGC GGATCCCATATG -TGCAGCCCCGCAGC BamHI- 

Ndel 

Reverse CCC GCTCGAG -GTTTTTTGCTGACGTAGT Xhol 
929a Forward AAAAAAGAATTC-CGCGGTTTGCTCAAAACAGGGCTGGG Eco RI 

Reverse AAAAA ATCTAGA -TTAAGAAAGACGGAAACTACTGCC Xbal 
931 Forward AAAAA AGAATTC -GCAACCCATGTTTTGATGGAAAC Eco RI 

Reverse AAAAAACTGCAG-TTACTGCCCGACAACAACGCGACG Pst I 

935 Forward AAAAA AGAATTC - Eco RI 

GCGGATGCGCCCGCGATTTTGGATGACAAGGC 
Reverse AAAAAACTGCAG-TCAAAACCGCCAATCCGCCGACAC Pst I 

936 Forward CGC GGATCCCATATG -GCCGCCGTCGGCGC BamHI- 

Ndel 

Reverse CCC GCTCGAG -GCGTTGGACGTAGTTTTG Xhol 

937 Forward AAAAAAGAATTC-CCGGTTTACATTCAAACCGGCGCAAC EcoRI 
Reverse AAAAAA CTGC AG -TTAAAATGTATGCTGTACGCC A A A p s t I 

939a Forward AAAAAA GAATTC -GGTTCGGC AGCTGTGATG A A AC.C. Eco RI 

Reverse AAAAA ACTGC AG -TTAACGC AAACCTTGGATA A A GTTGGr p s t I 
950 Forward CGC GGATCCCATATG -GCCAACAAACCGGCAAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTAGAACCGCATTTGCC Xhol 
953 Forward CGCGGATCCCATATG-GCCACCTACAAAGTGGAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTGTTTGGCTGCCTCGAT Xhol 
957 Forward CGCGGATCCCATATG-TTTTGGCTGGGAACGGG BamHI- 
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in oei 




Reverse 




AxlOl 


958 


Forward 


LGLGGA I LLLA rATG-GCLGATGCCGTTGCG 


BamHI- 








ATI _ T 

Ndel 




Reverse 


LjGV^C AALiL, I 1 -LrLrLr 1 L>G 1 1 1 G 1 1 vjL.Lt 1 L. 


TT' JT TT 

Hmdlli 


959 


Forward 


CGCGGATCLCATATG-CACCACGACGGACACG 


BamHI- 








Ndel 




iceverse 


prrnrTrnAn r^Tpr;TPriPr2ppr:P»r:A 


AnOl 


G£1 


rorwara 


pPrfwi a Trrr at a anr* ata a p^pp. a pp, a pp. 

LULUUA 1 ^V^l^/Y 1 /\ 1 O-OV^^/Y^AAOLvjA^LjALLj 


T~i T TT 

iDairirll- 








iNuei 




jveverse 




AilOi 


y IZ 


Forward 


A A A A A APtAA TTP 
AAuAAAALrAA 1 lv^- 


T7__ T>T 

bco Kl 






tth apt a ap Annnnnnn AnrriA a atta a a a ap 






Reverse 


A A A A A A TPT AHA TT A A A A ATA A TP A T A A TPT A P A TTTTP 
AAAAAA 1 L 1 AOA- 1 1 AAAAA 1 AA 1 LA 1 AA I L 1 ALA 1 1 1 1 Lr 


VL„ T 

Aba 1 


973 


Forward 


AAAAAAPAA T' PP A *TPP A A P A A P/~<P A A A A 

AAAAAALrAA 1 1 L-A 1 uuALuuLuLALAACCuAAAAC 


Eco RI 




Reverse 


AAAAAAL 1 GLAG- 1 1 AL 1 1 LALGLGGG 1 LGLLATL AGCGT 


Pst I 


982 


Forward 


CGCGGATCCCATATG-GCAGCAAAAGACGTAC 


BamHI- 








Ndel 




Reverse 


/~^/" > «p/"' , p r r , pp a p pa tp a Tr^r^r^r^r^i^r* a nr/^/^ 
LLCGL 1 LuAu-tA 1 LA 1 GLLGLLL A 1 LC 


Xhol 


983 


Forward 


CGCGGATCCCATATG-TTAGCTGTTGCAACAACAC 


BamHI- 








Ndel 




Reverse 


r^r^(~ y nr^ r vr^n at, p,a a r n r^r^r^ r r a htpt a pp 
LLLLrL 1 LGALr-LrAALLGG 1 AGLL 1 ALu 


Xhol 


987 


Forward 


CGCGGATCCCATATG-CCCCCACTGGAAGAAC 


BamHI - 








Ndel 




Reverse 


c*c*c*nr Ar vc*n a p ta ata a a ppttpt a tpcpp 
LLLuL 1 LLrALr- 1 AA 1 AAALL I 1 L 1 A 1 OLrLiL 


Xnol 


988 


Forward 


y""*1 *~~\ y — ^ s~~y y — ■( A r T> i — 1 A T' A T'/^ T*/'~1T r T , ' , T"» AAA T" 1 1 < A / — ^ y"^ / — ^ A A A A A 

CGCGGATCCCATATG-TCTTTAAATTTACGGGAAAAAG 


BamHI- 








Ndel 




Reverse 


npPp A A PPTT TP A TTTPPPTTTPPPTTTT 
LjLL^L AAVjL 11-1 LiA 111 LrLL 111 LLvj 1111 


TT' JTTT 

rlinalll 


989 


Forward 


CCGGAATTCTACATATG-GTCCACGCATCCGGCTA 


EcoRI- 








Ndel 




Reverse 


r^r^r^r^ r^Tt^n a p tttp a a tttpt a pptpt a r TT , r~^r^ 
LLLuL 1 LvjALt- 111 LrAA 1 1 1 Lr 1 Auu 1 G 1 A 1 1 GL 


Xhol 


990 


Forward. 


CGCGGATCCGCTAGC-TTCAGAGCTCAGCTT 


BamHI- 




2 




Nhel 




Reverse 


ppppptp/7 A r 1 a a a a cir^r^ a tttp a c^c^r^ a 
GLL^GL^ 1 L,GALj- AAAL ALtL^G A 111 GAGLGA 


Xhol 


992 


Forward 


CGCGGATCCCATATG-GACGCGCCCGCCCG 


BamHI- 








Ndel 




Reverse 


r y r*r^cic* r Tr*n a p~: r^r^ a a a thppp a a pp a ttp 
LLCuL 1 LuAu-LL AAA 1 LrLLLAALLA 1 1 L 


Xhol 


993 


Forward 


A r-f-iy~-> y^\ a a rr^y— y~ ^ a A rp/^\ y~^^T> /"I A rr»n y — t AAA n-»y^« A 

CGCGGATCCCATATG-GCAATGCTGATTGAAATCA 


BamHI- 








NdeT 




Reverse 


cccgctcgag-gaacacatcgcgcccg 


Xhol 


996 


Forward 


cgcggatcccatatg-tgcggcagaaaatccgc 


BamHI- 








Ndel 




Reverse 


cccgctcgag-tctaaacccctgttttctc 


Xhol 


997 


Forward 


ccggaattctagctagc-cggcacgccgacgtt 


EcoRI- 








Nhel 




Reverse 


cccgctcgag-gacggcatcgctcagg 


Xhol 
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Underlined sequences indicate restriction recognition sites. 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1>: 

gOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG GTGTCGGCGA ACGAGGTGTC 

51 CGGCAGGGCT TGCGCCCGGA TGGTGCTGGT CATCTGCCAG ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGC TCGGGTACGC ATACTTTACC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTGCG CAGCAAATCG ACAAT CATC A CGTTTTCGGC 

2 01 GCGGTTTTTC GGGTCGGTTT GTAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCTGAAG CGATGTTGAG GAAGAGTTCG GGCGAGAAAC AC AGCGT C C A 

351 CGCGGATTGC CCGGCTTCAT CGGGCAGGTG GGACAATACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 2; ORF 001.ng>: 

gOOl .pep 

1 MLPQGKAARR VSANEVSGRA CA RMVLVICQ TLPKRDTLNG SGTHTLPVWA 

51 ILPRSLRSKS TIITFSARFF GSVCNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEAMLRKSS GEKHSVHADC PASSGRWDNT A* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3>: 

mOOl, seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGcAssCTT ss . GCTTGGA yGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GATCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCCTCCGCAT CGGGCAGGTG GGACAAGACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 00 1>: 

rnOOl .pep 

1 MLPQGKAARR MSANEVCGXL X AWXVLVICQ TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PSASGRWDKT A* 



The following partial DNA sequence was identified in meningitidis <SEQ ID 5>: 

a001 . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGCAAGGCT TGGGCTTGGA TGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTACG CAGCAAATCG ACAAT CAT C A CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCTTGTGCAT CGGGCAGGTG GGACAAAACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 6; ORF 001. a>: 

aOOl.pep 

1 MLPQGKAARR MSANEVCGKA W AWMVLVICQ TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PCASGRWDKT A* 

mOOl/aOOl 96.2% identity over a 131 aa overlap 



10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
I M It I t I I I I I i M M I I I I I I ! I I I I I I I I I 1 I II i I I I I I M I I 1 I I I I 1 I I I 
aOOl . pep MLPQGKAARRMSANEVCGKAWAWMVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
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10 20 30 40 50 60 

70 80 90 100 110 120 

mOOl .pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 
I I I I I M I I I I I I M I I I I I I 11 I I I II II It t I I I II II II II I t I I I I I M I I I I t I I 
aOOl.pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 

70 80 90 100 110 120 



130 

mO 0 1 . pep PSASGRWDKTAX 
I I H I I I I I I I 
aOOl .pep PCASGRWDKTAX 
130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 001 shows 89.3% identity over a 131 aa overlap with a predicted ORF (ORF OOl.ng) 
from N. gonorrhoeae: 

mOOl/gOOl 

10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
1 I II M I I I I : II M I t 1 I I I M I I I I I I I I i ! M I I I I : I I I I I M I I I I I I I 
gOOl MLPQGKAARRVSANEVSGRACARMVLVICQTLPKRDTLNGSGTHTLPVWAILPRSLRSKS 

10 20 30 40 50 60 



70 80 90 100 110 120 

mOOl.pep TIITFSARFFGSACN SAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 

I I t I I I I I I I II : I I I 1 I II M I M II M 11 M I 1 I M I I M I : M M I i 1 11 M I I M 
gOOl TIITFSARFFGSVCNSAARRSSCPSPKIGAVPFIGSVLMVPSEAMLRKSSGEKHSVHADC 

70 80 90 100 110 120 



130 

mOOl . pep PSASGRWDKTAX 
1 :: I I I I I : I I I 
gOOl PASSGRWDNTAX 
130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 7>: 

g003 . seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCGTATTC GGTCGCGCTG TTTTGGGTCA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGCTTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAGG TAGTCCTGAT GGTATTCCTC GGCGTCGTAG 

251 AAGTTTTTCA GCGGTTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTGC CGGTGTCGTT ACCCTGTTTG 

401 TTGAGGCTGG TCGGATCAAC GACGCGGAAA TAATATTGCA GGATGTCGTC 

451 CAGgCTGagt TTGTCGGCAT CGTaggtcac tTTGACGGTC TCGGCATGAC 

501 CCGTATGGCG GTaggacact tctTCgtanc TcGGGtTTTC CGTGttGCCG 

551 TTGGCgttac cGGATACCGC gtcaACCACG CCGTcgatgc gttggaAATa 

601 ggCTTCCAAg ccccaaaagc agccgccggc gaagtaaatg gtgcccgtgt 

651 tcatgattGC TGa 

This corresponds to the amino acid sequence <SEQ ID 8; ORF 003. ng>: 

g003 .pep 

1 MWFVAEGVF GRAVLGHLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGFG 

51 FARQRFVGFA DVDV AVAVGV FNQWLMVFL GVVEVFQRFV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG VVH AAAVLRAGVV TLFV EAGRIN DAEIILQDW 
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151 QAEFVGIVGH FDGLGMTRMA VGHFFVRVFR VAVGVTGYRV NHAVDALEIG 
201 FQAPKAAAGE VNGARVHDC 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 9>: 

m003 . seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGsTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGGG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CAGCkTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTtCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGk CGGCGATGAC GGCTTTTTCG kCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCg . CCGGC GAGGTAAATG GTGCGCGTGT 

651 TCATGATTTT TGA 

This corresponds to the amino acid sequence <SEQ ID 10; ORF 003>: 

m003.pep Length: 221 

1 MWFVAEGIF GRAVLGNLXL LFGQGAFEFG VTRFFIRCRV EAFALRGGLG 

51 FARQRFVSXA DVD VAVAVGV FNQVVLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGXGDD GFFXGVG WH AAAVLRTGVV ALFV EAGRIN DAEEILQDVV 

151 * AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAXG EVNGARVHDF * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1>: 

a003 . seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATATCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTTCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

401 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCCGCCGGC GAGGTAGATG GTGCGCGTGT 

651 TCATGATTTT TGA 

This corresponds to the amino acid sequence <SEQ ID 12; ORF 003. a>: 

a003 .pep 

1 MWFVAEGIF GRAVLGNLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGLG 

51 FARQRFVGFA DID VAVAVGV FNQVVLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG VVH AAAVLRTGVV ALFV EAGRIN DAEEILQDVV 

151 *AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAAG EVDGARVHDF * 



m003/a003 95.9% identity over a 220 aa overlap 



10 20 30 40 50 60 

m003.pep MWFVAEG I FGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLG FARQRFVSXA 

I I M I I I II I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I M I I I I : I 
a003 MWFVAEG I FGRAVLGNLVLLFGQGAFEFGVTRFFIRCRVEAFALRCGLGFARQRFVG FA 

10 20 30 40 50 60 
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.70 80 90 100 110 120 

m003 . pep DVDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGVVH 
I : I I I I ! I I I I I I I 1 I I I I I 1 I I I I I I I M I I I ! I I I I M I I I! If I I M I I I I I I I I 
a003 DIDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGGGDDGFFGGVGVVH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m003.pep AAAVLRTGWALFVEAGRINDAEEILQDVVXAEFVGIVGHFDGFGVARMAVGHVFIARIF 
I I I I I I I I M I I II i I I I I! I I M I 1 I I M I I I I I I II I I I I t I I I I I I II M I I I I I I I 
a003 AAAVLRTGVVALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 

130 140 150 160 170 180 

190 200 210 220 

m003 . pep RVAVGVAG YRVNHAVD ALE I G FQAPE AAXGE VNGARVHD FX 

M I II i I I I I I M I I I I I I I.I M I I I I I 111:11111111 
a 0 0 3 RVAVG VAG YR VNH AV DALE I G FQAPE AAAGEV DGARVH D FX 

190 200 210 220 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 003 shows 88.6% identity over a 219 aa overlap with a predicted ORF (ORF 003 .ng) 
from N. gonorrhoeae: 

m003/g003 

10 20 30 40 50 60 

m003 . pep MVVFVAEGIFGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLGFARQRFVSXA 
I M I I I M : I I I I I I I : I 1 I I M I I I I I I I! M I I I M M I M I I 1:11111111: I 
g003 MWFVAEGVFGRAVLGHLVLLFGQGAFE FGVTRFFIRCRVEAFALRCGFGFARQRFVGFA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m003 . pep DVDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGWH 
I I I I I I I I M I I I I I I I I I I I : I I I I I I : I I I I I I I I M I I I I I I I MINI I I I 1 I I 
g003 DVDVAVAVGVFNQVVLMVFLGWEVFQRFVFNNEGQLVFLLLAFEGGGDDGFFGGVGVVH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m003 . pep AAAVLRTGWALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 
I M I II : I M : I I I I II I I I I I I MINI I I I I I I I I If I I : I : : li I I M I : I : I 
g003 AAAVLRAGWTLFVEAGRINDAEIILQDWQAEFVGIVGHFDGLGMTRMAVGHFFV-RVF 

130 140 150 160 170 180 

190 200 210 220 

m003 . pep RV AVG VAG YRVNHAVDALE I G FQAPE AAXGE VNGARVHD FX 

I I I I I I : I I M I I I I I M I II I I I I : I I M II II II M 
g 0 0 3 RVAVG VTG YRVNHAVDALE I G FQAPKAAAGE VNGARVHDC 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 13>: 

g004 . seq 

1 ATGgtagAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GCGCCCATGC CAACAagtga gccaAAtgtT CGGCGGCAGG GCCTacgatT 

101 TCCGCGCCGA TAAagcggcc gGTGgctTTT tcgGCataca ggcgcaTatg 

151 gCCTTTGTTT ACCAgcatca cgcggctgcg accttgaTTT TTGAACGATA 

201 CTTCGCCgaT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCgga ctggtaaACA CCACGCCAAT 

301 GGTgctgcgg cGCAAACCGC TGCCGATATt cgGgtagcgg ccccgcgtta 

351 ttgcccggca atcttacctt ggtcggcggc ttcatGCAGC AGGGGCagtt 

401 ggttggacgc gtcgcccgca a t a AAGAT AT GCGGAATgct ggtCTGCATg 

4 51 gtCAGCGGAT CGGCAACGGG tacgccgcgc gcgtctttgT CGATATTGAT 

501 GTTTTCCAAA CCGATATtgT CAACGTTCGG ACGGCgACCT ACGGCTGCCA 
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551 ACATATATTC GGCAACAAAT ACGCCTTTTT CGCCATCCTG CTCCCAATGG 

601 ACTtctACAT TGCCGTCTGC GTCGAGTTTG ACCTCGGTTT TAG CAT CC AG 

651 ATGCAGTTTC AATtctTCTC CGAACACGGC TTTCGCCTCG TCTGAAACAA 

7 01 CGGGGTCGGA AATGCCGCCG ATGATTCCGC CCAAACCGAA AATTTCAACT 

7 51 TTCACACCCA AACGGTGCAA TGCCTGA 



This corresponds to the amino acid sequence <SEQ ID 14; ORF 004.ng>: 

g004 .pep 

1 MVERHIQHLR NGHLHLMRPC QQVSQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVYQHHAAA TLIFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAN 

101 GAAAQTAADI RVAAPRYCPA ILPWSAASCS RGSW LDAS PA IKICGMLVCM 

151 VSGSATGTPR ASLSILMFSK PILSTFGRRP TAANIYSATN TPFSPSCSQW 

201 TSTLPSASSL TSVLASRCSF NSSPNTAFAS SETTGSEMPP MIPPKPKIST 

251 FT PKRCNA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 15>: 

m004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCAGG GCCTACGATT 

101 TCCGCGCCGA TAAAGCGGCC GGTGGCTTTT TCGGCATACA GG C G CAT AT G 

151 GCCTTTGTTC AC C AG CAT C A CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

2 51 TATTTCAGAC CGACAAAGCC GATTTGCGGA CTGGTAAACA CCACGCCGAT 

301 GGTGCTGCGC CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCAGCTTC ATGCAGCAGA GGCAGTTGGT 

4 01 TGGACGCATC GCCTGCGATG AAGATATGCG GAATACTGGT CTGCATGGTC 

4 51 AGCGGGTCGG CAACAGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATATT 

501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCCACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCATC GAGTTTGACC TCGGTTTTAG CATCCAGATG 

651 CAGTTTCAAT TCTTCGCCGA ACACGGCGTT CGCCTCGTCT GAAACGACGG 

7 01 GGTCGGAAAT GCCGCCGATG ATTCCGCCCA AACCGAAAAT TTCAACTTTC 

7 51 ACGCCCAAAC GGTGCAATGC CTGA 



This corresponds to the amino acid sequence <SEQ ID 16; ORF 004>: 

m004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVHQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAD 

101 GAAPQTAADI RVAAALS PAI LPWSAASCSR GSW LDASPAM KICGILVCMV 

151 SGSATGTPRA SFSILIFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLT SVLASRCSFN SSPNTAFASS ETTGSEMPPM IPPKPKISTF 

251 TPKRCNA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 17>: 

a004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCCGG ACCTACGATT 

101 TCTGCGCCGA TGAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTT AC C AG CATC A CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCGGA CTGGTGAACA CTACGCCGAT 

301 GGTGCTGCGG CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCGGCTTC ATGCAGCAGG GGCAGTTGGT 

4 01 TGGACGCGTC GCCCGCAATA AAGATATGCG GAATACTGGT CTGCATAGTC 

4 51 AGCGGATCGG CAACGGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATGTT 

501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCTACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCGTC GAGTTTGGCC TCGGTTTTAG CAT C C AAAT G 

651 CAGTTTCAAT TCTTCACCGA ACACGGCTTT CGCCTCGTCT GAAACGACGG 

701 GGTCGGAAAT GCCGCCGATG ATGCCACCCA AACCGAAAAT TTCAACTTTC 

7 51 ACGCCCAAAC GGTGCAATGC CTGA 
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This corresponds to the amino acid sequence <SEQ ID 18; ORF 004. a>: 

a004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR TYDFCADEAA GGFFGIQAHM 

51 AFVYQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGEHYAD 

101 GAAAQTAADI RVAAALSPAI LPWSAASCSR GSWL DASPAI KICGILVCIV 

151 SGSATGTPRA SFSILMFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLA SVLASKCSFN SSPNTAFASS ETTGSEMPPM MPPKPKISTF 

251 TPKRCNA* 

m0O4/aoO4 94.9% identity over a 257 aa overlap 

10 20 30 40 50 60 

lt\004 . pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 
I M I I I I II I I I I I t I I i I I I I I I I I I I I I : I I I I I : I I I I M I I I I I II I I : I I I i I I 
a004 MVERHIQHLRNGHLHLMCPSQQVRQMFGGRTYDFCADEAAGGFFGIQAHMAFVYQHHAAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

n\004 . pep ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAALSPAI 
I I I I I I M I I I I I I II M I I I II I I I I I I I I I I II : I : I I I M I I I I I I II II I I II I I 
a004 AL V FER Y FAD DKFVGLVLRGNLRVFQTDKADLRTGEHYADGAAAQTAAD I RVAAALSPAI 

70 80 90 100 110 120 

130 140 150 160 170 180 

m004 .pep LPWS AASCS RG S WLD AS PAMKICG I LVCMVSGSATGTPRASFS I LI FSKP ILSTFGRRPT 

I I II M I I I I I I M I I I I I : I I I I I I I I : II I I I I I I I II I I I I I : I I I I II I I II I I I I 
a 004 LPWSAASCSRGSWLDAS PAIKICGILVCIVSGSATGTPRASFSILMFSKPILSTFGRRPT 

130 140 150 160 170 180 

190 200 210 220 230 240 

m004.pep AASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPPM 
I I I I I I I I I I I I I I I i II I I I M I I I I I I : II I I I : I I II I I I I II I I I I I I I I I II I I I 
a004 AASIYSATNTPFSPSCSQWTSTLPSASSLASVLASKCSFNSSPNTAFASSETTGSEMPPM 

190 200 210 220 230 240 

250 

m004 . pep IPPKPKISTFTPKRCNAX 
: I II I I I I I I I I I I M I I 
a004 MPPKPKI STFTPKRCNAX 

250 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 004 shows 93.4% identity over a 258 aa overlap with a predicted ORF (ORF 004.ng) 
from N. gonorrhoeae: 

m004/g004 

10 20 30 40 50 60 

m004 . pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 
I I I I I I I I I I I I I I I I I I III I I I I II I I I I I I I I I I I 1 I I I I I I I I I I I : I I I I I I 
g004 MVERHIQHLRWGHLHLMRPCQQVSQMFGGRAYDFRADKAAGGFFGIQAHMAFVYQHHAAA 

10 20 30 40 50 60 . 

70 80 90 100 110 119 

m004 . pep ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAA-LSPA 
: I : I I I I III I I I I I I I II I I I I I I I I I I I I I I I I I I I I : II I I I I I I I I M I I I 

g004 TLIFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHANGAAAQTAADIRVAAPRYCPA 

70 80 90 100 110 120 



iu004 .pep 



120 130 140 150 160 170 179 

ILPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRP 
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I I i I I I I I I I I I I N I I ! I 1 : 11 i I : I 1 I I I 1 I I I I I I I I I I : I I I : M I 'M I I I I I I I I 
g 004 ILPWSAASCSRGSWLDASPAIKICGMLVCMVSGSATGTPRASLSILMFSKPILSTFGRRP 

130 140 150 160 170 180 

180 190 200 210 220 230 239 

m004 . pep TAASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPP 
I ! I : I I I I III I I I I I I I I I i I I III I I I I I I I IN ! I I I I I I I I Ml M ! I I M I I I I I 
g004 TAANIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPP 

190 200 210 220 230 240 

240 250 
ro004 .pep MI PPKPKI STFT PKRCNAX 

I I I I I I I I I I 1 II M I I I 
g004 MIPPKPKISTFTPKRCNA 

250 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 19>: 

g005 . seq 

1 ATGGGGATGG ACAATATTGA TATGTTCATG CCTGAACAAG AGGAAATCCA 

51 ATCAATGTGG AAAGAAATTT TACTGAATTA CGGTATTTTC CTGCTCGAAC 

101 TGCTTACCGT GTTCGGCGCA ATTGCGCTGA TTGTGTTGGC TATCGTACAG 

151 AGTAAGAAAC AGTCGGAAAG CGGCAGTGTC GTACTGACAG ATTTTTCGGA 

201 AAATTATAAA AAACAGCGGC AATCGTTTGA AACATTCTTT TTAAGCGAGG 

2 51 AAGAGACAAA AC AT C AG G AA AAAAAAGAAA AGAAAAAGGA AAAGGCGGAA 

301 GCCAAAGCAG AG AAAAAG C G TTTGAAGGAG GGCGGGGAGA AATCTGCCGA 

351 AACGCAAAAA TCCCGCCTTT TTGTGTTGGA TTTTGACGGC GATTTGTATG 

401 CACACGCCGT AGAATCCTTG CGTCATGAGA TTACGGCGGT GCTTTTGATT 

4 51 GCCAAGCCTG AAGATGAGGT TCTGCTCAGA TTGGAAAGTC CGGGCGGCGT 

501 GGTTCACGGT TACGGTTTGG CGGCTTCGCA GCTTAGGCGT TTGCGCGAAC 

551 GCAATATTCC GCTGAccgtc gccgTCGATA AGGTCGCGGC AAGCGgcggc 

601 tatatgatgg cgtgtgtgGC GGATAAAATT GTTTCCGCtc cgtttgcggt 

651 catcggttcg gtgggtgtgg tgGcggaagt gcCGAATATC CAccgCctGT 

7 01 TGAAAAAACA TGATATTGAT GTGGATGTGA TGACGGCGGG CGAATTTAAG 

7 51 CGCACGGTTA CTTTTATGGG TGAAAATACG GAAAAGGG C A AACAGAAATT 
801 CCGGCAGGAA C T GG AG G AAA CGCATCAGTT GTTCAAGCAG TTTGTCAGTG 

8 51 AAAACCGCCC CGGGTTGGAT ATTGAAAAAA TAGCGACGGG CGAGCATTGG 
901 TTCGGCCGGC AGGCGTTGGC GTTGAACTTG ATT G AC GAGA TTTCGACCAG 
951 TGATGATTTG TTGTTGAAAG CGTTTGAAAA CAAACAGGtt aTCGAAGTGA 

1001 AATATCAGGA GAAGCGAAGC CTGATCCAGC GCATTGGTTT GCAGGCGGAA 

1051 GCTTCCGTTG AAAAGTTGTT TGCCAAACTT GTCAACCGGC GAGCGGATGT 

1101 GATGTAG 

This corresponds to the amino acid sequence <SEQ ID 20; ORF 005.ng>: 

g005 . pep 

1 MGMDNIDMFM PEQEEIQSMW KEILLNYG IF LLELLTVFGA IALIV LAIVQ 

51 SKKQSESGSV VLTDFSENYK KQRQSFETFF LSEEETKHQE KKEKKKEKAE 

101 AKAEKKRLKE GGEKSAETQK SRLFVLDFDG DLYAHAVESL RHEITAVLLI 

151 AKPEDEVLLR LESPGGVVHG YGLAASQLRR LRERNIPLTV AVDKVAASGG 

201 YMMACVADK I VSAPFAVIGS VGVVAE VPNI HRLLKKHDID VDVMTAGEFK 

251 RTVTFMGENT EKGKQKFRQE LEETHQLFKQ FVSENRPGLD IEKIATGEHW 

301 FGRQALALNL IDEISTSDDL LLKAFENKQV IEVKYQEKRS LIQRIGLQAE 

351 ASVEKLFAKL VNRRADVM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21>: 

m005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA C AAG AG G AAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT AC AG AG T AAG 

151 AAACAGTCGG AwAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 CACAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 
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301 GCAGAGAAAA A. CGTTTGAA GGAGGGTGGG GAGAAATCTG CCGAAACGCA 

351 nAAATCACGC CTTTTTGTGT TGGANNNNNN NNNNNNNNNN NNNNNNNNNN 

4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGCGAGCGG CGGTTATATG 

601 ATGGCGTGTG TGGCGGATAA AATTGCTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTGGTGGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

7 01 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

7 51 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT G GAT ATT GAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGT A 

1101 G 

This corresponds to the amino acid sequence <SEQ ID 22; ORF 005>: 

m005 .pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LAIVQSK 

51 KQSXSGSWL TDFSENYKKQ RQSFEAFFLS GEEAQHQEKE EKKKEKAEAK 

101 AEKXRLKEGG EKSAETXKSR LFVLXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXASGGYM 

201 MACVADK IAS APFAIVGSVG VVAEV PNIHR LLKKHDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 

301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 

351 VEKLFAKLVN RRADVM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23>: 

a005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA C AAG AG G AAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT AC AG AGT AAG 

151 AAACAGTCGG AAAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

251 C AAAAC AT C A G G AAAAAG AG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 

301 GCAGAGAAAA AGCGTTTGAA GGAGGGTGGG GAGAAATCTT CCGAAACGCA 

351 AAAATCCCGC CTTTTTGTGT TGGATTTTGA CGGCGATTTG TATGCACACG 

4 01 CCGTAGAATC CTTGCGTCAT GAGATTACGG CGGTGCTTTT GATTGCCAAG 

4 51 CCTGAAGATG AGGTTCTGCT TAGATTGGAA AGTCCGGGCG GCGTGGTTCA 

501 CGGTTACGGT TTGGCGGCTT CGCAGCTTAG GCGTTTGCGC GAACGCAATA 

551 TTCCGCTGAC CGTCGCCGTC GATAAGGTGG CGGCGAGCGG TGGTTATATG 

601 ATGGCGTGTG TGGCGGATAA AATTGTTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTTGTAGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

701 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

751 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGTA 

1101 G 

This corresponds to the amino acid sequence <SEQ ID 24; ORF 005. a>: 

a005.pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LAIVQSK 

51 KQSESGSVVL TDFSENYKKQ RQSFEAFFLS GEEAKHQEKE EKKKEKAEAK 

101 AEKKRLKEGG EKSSETQKSR LFVLDFDGDL YAHAVESLRH EITAVLLIAK 

151 PEDEVLLRLE S PGGWHGYG LAASQLRRLR ERNIPLTVAV DKVAASGGYM 

201 MACVADK IVS APFAIVGSVG VVAEV PNIHR LLKKHDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 
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301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 
351 VEKLFAKLVN RRADVM* 

L oo5/a005 79.2% identity over a 366 aa overlap 

10 20 30 40 50 60 

m005.pep MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSWL 
I I I i I I I I I I M I I I I 1 I I I I I t I I I I I I M I I I I i I I I I 1 I M I I I I I I I M MINI 
a 005 MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSWL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m005.pep TDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKXRLKEGGEKSAETXKSR 
M I I I I I II I I! i I I I M I I I I I I : I I I I ! I I I I I 11 M I! M t I I I M II t : I I Ml 
a 005 TDFSENYKKQRQSFEAFFLSGEEAKHQEKEEKKKEKAEAKAEKKRLKEGGEKSSETQKSR 

70 80 90 100 110 120 

130 140 150 160 170 180 

mOOS.pep LFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I 1 I I : 
a 005 LFVLDFDGDLYAHAVESLRHEITAVLLIAKPEDEVLLRLESPGGWHGYGLAASQLRRLR 

130 140 150 160 170 180 



190 200 210 220 230 240 

m005 . pep XXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGVVAEVPNIHRLLKKHDIDVD 
: I I I M I II I I I I M : I I I I I I II I M I II I I I I I I I I I I I I I I I M 

a 005 ERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAIVGSVGWAEVPNIHRLLKKHDIDVD 

190 200 210 220 230 240 

250 260 270 280 290 300 

m005 . pep VMTAGE FKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 

I | M I I I M M I I II I! I I I M I I M I I 1 I I M I I I I I I I I 1 I I I I M I I I M I I I I I II 
a 005 VMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 

250 260 270 280 290 300 



310 320 330 340 350 360 

m005 . pep RQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKLVN 
M I I I 1 I II I I I I I I f II I I I M I I I I I M M I I 11 If I I I I I I I I I I M I M M I I 1 I I 
a 005 RQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKLVN 

310 320 330 340 350 360 



mOOS.pep RRADVMX 
II I I I I I 

a005 RRADVMX 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 005 shows 77.0% identity over a 366 aa overlap with a predicted ORF (ORF 005.ng) 
from N, gonorrhoeae: 

m005/g005 

10 20 30 40 50 

mOOS.pep MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSV 
I M M I II i I I II I II I I M I 1 M I I I I I 11 I I I 1 I I I I I I I I I I I I I I I M I I I I I 
g 005 MGMDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSV 

10 20 30 40 50 60 

60 70 80 90 100 110 

mOOS.pep VLTDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKKRLKEGGEKSAETXK 

1 I I I I I II I I I I 1 I I I I : II I I I 1 : : I I I I : I I M I I I I I I I 11 I I M I I I I I I 1 II I 
g 0 0 5 VLTDFSEN YKKQRQS FET FFLSEEETKHQEKKEKKKEKAEAKAEKKRLKEGGEKS AETQK 

70 80 90 100 110 120 
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120 130 140 150 160 170 

m005 . pep SRLFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

MINI : 
g005 SRLFVLDFDGDLYAHAVESLRHEITAVLLIAKPEDEVLLRLESPGGVVHGYGLAASQLRR 

130 140 150 160 170 180 



180 190 200 210 220 230 

mOOS.pep XXXXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGVVAEVPNIHRLLKKHDID 

: I I I I I I I I I I I I M : I I M I : : I I I I II I ! 1 M I I I 11 I I I I I I 

gOOS LRERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAVIGSVGVVAEVPNIHRLLKKHDID 

190 200 210 220 230 240 



240 250 260 270 280 290 

m005 . pep VDVMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSEMRPQLDIEEVATGEHW 
I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I :: I I I I II 
g005 VDVMTAGEFKRTVTFMGEMTEKGKQKFRQELEETHQLFKQFVSENRPGLDIEKIATGEHW 

250 260 270 280 290 300 



300 310 320 330 340 350 

m005 . pep FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKL 
I I I I I I I I I I I II M I I I I I II I I I I I M I I i I I I I I I : I I I I I I I I I I I I I I I I I I I I I 
g005 FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKRSLIQRIGLQAEASVEKLFAKL 

310 320 330 340 350 360 



360 

mO 0 5 . pep VNRRADVMX 
I I I I I I I I I 
g005 VNRRADVMX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 25>: 

g006. seq 

1 ATGCTGCTGG TGCTggaatt ttggttCGGc gtGtCGGCGG TGGGCatact 

51 tgCGTTGTTT TTATGGCttt TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC gcgACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGG CAGCT GTACCGCCAT TACGGACTGG TTTCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCG 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACGGCA GCGCGGGGCA TAT T TAT T CG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATG AGTTTGGACG ATGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG G AAC AT C AAA 

4 51 GCCGGAACTT GA 



This corresponds to the amino acid sequence <SEQ ID 26; ORF 006.ng>: 

g006 -pep 

1 MLLVLEFWFG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDKTHFIR 

51 KGDERQLYRH YGLVSRLRVL 1 5NREAFGYL CVGAAMGILF GFAFVM MTLK 

101 GYGSAGHIYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 

151 AGT* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 27>: 

m006 . seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACC GGCGG CAGCT GTACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 
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351 GTTTGCCATG AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 
4 01 ATT TGAAAG A CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 
4 51 GCCGGAACTTGA 

This corresponds to the amino acid sequence <SEQ ID 28; ORF 006>: 

xn00 6 .pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 
51 KGDRRQLYRH YGLLARLRVL ISNREAFGYL CVGTAMGILF GFAFVMM TLK 
101 GYSSAGHVYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 
151 AGT* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 29>: 

a006 . seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAAGAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GGACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

3 01 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 

3 51 GTTTGCCATA AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGAAACG GAACATCAAA 
4 51 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 30; ORF 006.a>: 

a006.pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLKN SLERDNHFIR 
51 KGDERQLDRH YGLLARLRVL ISNREAFGY L CVGTAMGILF GFAFVMM TLK 
101 GYSSAGHVYS VGTYLWMFAI SLDDVPRLVE QYSNLKDIGQ RIEWSKRNIK 
151 AGT* 

moo6/a006 96.7% identity over a 153 aa overlap 

10 20 30 40 50 60 

m00 6.pep MLLVLE FWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDRRQLYRH 

I I I I I I I I i I 11 1 I 1 I I I I 1 11 t 1 I I 1 I I I I I M 1 M I : I! 1 ! 1 I I I I M 1 M : 1 I 1 1 I 
a006 MLLVLE FWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERDNHFIRKGDERQLDRH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m006.pep YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYS VGTYLWMFAM 

I I I I I I I I I I I i 11 I I M I i I I I I I I II I I i I I I I I I I i I I I I I M I I I I I I I t I I I I I : 
a006 YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYSVGTYLWMFAI 

70 80 90 100 110 120 

130 140 150 

m006.pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I I I I I ! I I M I I I M I I I I I I I I I : I II I I M I 
a 006 SLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 006 shows 95.4% identity over a 153 aa overlap with a predicted ORF (ORF 006.ng) 
from N. gonorrhoeae: 

m006/g006 

10 20 30 40 50 60 

m006 . pep MLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDRRQLYRH 
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MINIM f 1 1 I If 11 1 1 1 1 ! I M I f 1 1 I 1 1 1 1 1 1 1 I II I 1 1 1 1 1 1 1 f I I 1 1 = I f 1 1 1 I 

g 0 0 6 MLLVLEFWFGVSAVGI LALFLWLLPRFAAI SENLYFRLNNSLERDNHFI RKGDERQLYRH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m006 . pep YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYSVGTYLWMFAM 

M I ^ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 = 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 

g 0 0 6 YGLVS RLRVLI SNRE AFG YLCVG AAMG I LFG FAFVMMTLKG YGS AGHI YS VGT YLWMFAM 

70 80 90 100 110 120 

130 140 150 

m006 . pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

1 1 M M M I M M M M M H M M M M M M 

g 1 1 8 SLDDVPRLVEQYSNLKDIGQRI EWSERNI KAGT 

130 140- 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31>; 

g006-l . seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCAAA ACCCACCGCA AG CG ATT GAT 

51 TGGCACATTT TCCCCGGTCG GACTGGAAAA CCTTTTGATG CTGGGGTATC 

101 CGGTGTTTGG CGGCTGGGCG ATTAATGCCG TGATTGCGGG GAGGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTATTT TTGATGTGGC TGGTCGGTGC 

201 GGCACGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTTGTGTTG GAACAACGGC AGCGGCAAGT CCCGCATTCA 

301 GCGGTAACTG CACGGGTTGC CCTGTCGCGT GAATTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

401 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACGAGCGG CAGCTGTACC GCCATTACGG ACTGGTTTCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 

651 CGGCGCGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

701 TCAAAGGCTA CGGCAGCGCG GGGCATATTT ATTCGGTCGG CACTTATCTG 

7 51 TGGATGTTTG CCATGAGTTT GGACGATGTG CCGCGATTGG TCGAACAATA 

801 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 

851 TCAAAGCCGG AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 32; ORF 006-l.ng>: 

g006-l.pep 

1 MWKMLKHIAK THRKRLIGTF SPVGLENLLM LGYPVFGGWA INAVIAGRVW 

51 QALLYALVVF LMWLVGAA RR IADTRTFTRI YTEIAVPWL EQRQRQVPHS 

101 AVTARVALSR EFVSFFEEHL PIA ATSWSI FGACIMLLVL E FWVGVSAVG 

151 ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDER QLYRHYGLVS 

201 RLRVLISNRE AFGY LCVGAA MGILFGFAFV MM TLKGYGSA GHIYSVGTYL 

251 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33>: 

m006-l . seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCCAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCTGGTCG GACTGGAAAA CCTTTTGATG CTGGTGTATC 

101 CGGTGTTTGG CGGCCGGGCG ATCAATGCCG TGATTGCGGG GGAGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTGCTT TTGATGTGGC TGGTCGGTGC 

2 01 GGTGCGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

2 51 TCGCCGTGCC GGTCGTGTTG GAACAGCGGC AGCGACAAGT CCCGCATTCG 

301 GCGGTAACTG CGCGGGTTGC CCTGTCGCGT GAGTTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACCGGCGG CAGCTGTACC GCCATTACGG ACTGCTTGCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 



WO 99/57280 



177 



PCT/US99/09346 



651 CGGCACGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

7 01 TCAAAGGCTA CAGCAGCGCG GGGCATGTCT ATTCGGTCGG CACTTATCTG 

7 51 TGGATGTTTG CCATGAGTTT GGACGACGTG CCGCGATTGG TCGAACAATA 

801 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 

851 TCAAAGCCGG AACTTGA 



This corresponds to the amino acid sequence <SEQ ID 34; ORF 006- 1>: 

m006-l -pep 

1 MWKMLKHIAQ THRKRLIGTF SLVGLENLLM LVYPVFGGRA INAVIAGEVW 

51 Q ALLYALVVL LMWLVGAV RR IADTRTFTRI YTEIAVPVVL EQRQRQVPHS 

101 AVTARVALSR EFVSFFEEHL PI AATSVVSI FGACIMLLVL E FWVGVSAVG 

151 ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDRR QLYRHYGLLA 

201 RLRVLISNRE AFGY LCVGTA MGILFGFAFV M MTLKGYSSA GHVYSVGTYL 

251 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 

m006-i/gO06-i 95.5% identity in 288 aa overlap 



10 20 30 40 50 60 

irt006-l,pep MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
I 1 M I M I I : 1 M t M M I It I I M I I M I I I I I I I I I I I M M : I M I II 1 I II I : 
g006-l MWICMLKHIAKTHRKRLIGTFSPVGLENLLMLGYPVFGGWAINAVIAGRVWQALLYALWF 

10 20 30 40 50 60 



70 80 90 100 110 120 

m006-l.pep LMWLVGAVRRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
I I I I f I I : M I I I I I I I! ! I I I I II I I II I I I I I I J I I I M I I I i I I I I II I I I I I I I I i 
g006-l LMWLVGAARRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

70 80 90 100 110 120 



130 140 150 160 170 180 

m006-l.pep P I AAT SVVS I FGAC IMLLVLE FWVGVS AVG I LALFLWLL PRFAAI SEN LYFRLNNS LERD 
I I II I II I I I I I I I I I I I I I I I I II I I I I I II I I I I I M I I I I I I M I I I I I I I I I I I I I 
g006-l PIAATSVVSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 

130 140 150 160 170 180 



190 200 210 220 230 240 

m006-l .pep NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
I I I I II I I : I M I I I I M :: I I I I I I I I I I I I I I M 1 1 : I I I I I II I I II I I ! I I II : I I 
gOO 6- 1 NHFIRKGDERQLYRHYGLVSRLRVLI S N RE AFG Y LC VG AAMG I L FG FA FVMMT LKG Y G S A 

190 200 210 220 230 240 



250 260 270 280 289 

m006-l .pep GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I : I I II I II I M I i t M I i I M I i II II II I M I I I I II II 1 I I I I i I 
g006-l GHIYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 35>: 

a006-l.seq (partial) 

1 . . AGCCAAAACC ACCGCAAGCG ATTGATTGGC ACATTTTTTC TGGTCGGACT 

51 GGAAAACCTT TTGATGCTGG TGTATCCGGT GTTTGGCGGC TGGGCGATTA 

101 ATGCCGTGAT TGCGGGGCAG GCGTGGCAGG CGTTGCTGTA CGCTTTGGTT 

151 GTGCTTTTGA TGTGGCTGGT CGGTGCGGCG CGGCGGATTG CCGATACGCG 

201 CACGTTTACG CGGATTTATA CCGAAATCGC CGTGCCGGTT GTGTTGGAAC 

251 AGCGGCAGCG GCAAGTCCCG CATTCGGCGG TAACTGCGCG GGTTGCCCTG 

301 TCGCGTGAGT TTGTCAGCTT TTTTGAAGAA CACCTGCCGA TTGCCGCGAC 

351 ATCCGTCGTA TCCATATTCG GCGCGTGCAT CATGCTGCTG GTGCTGGAAT 

4 01 TTTGGGTCGG CGTGTCGGCG GTGGGCATAC TTGCGTTGTT TTTATGGCTT 

4 51 TTGCCACGTT TTGCCGCCAT CAGCGAAAAC CTGTATTTCC GCCTGAAGAA 

501 CAGCTTGGAA CGCGACAACC ACTTTATCCG AAAAGGCGAC GAGCGGCAGC 

551 TGGACCGCCA TTACGGACTG CTTGCGCGCC TGCGTGTGCT GATTTCCAAC 

601 CGCGAAGCCT TCGGCTATCT CTGCGTCGGC ACGGCGATGG GTATTTTGTT 

651 CGGCTTTGCT TTTGTGATGA TGACGCTCAA AGGCTACAGC AGCGCGGGGC 
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701 ATGTCTATTC GGTCGGCACT TATCTGTGGA TGTTTGCCAT AAGTTTGGAC 

751 GACGTGCCGC GATTGGTCGA ACAATATTCC AATTTGAAAG AC AT CGGAC A 

801 AC GG AT AG AG TGGTCGAAAC GGAACATCAA AGCCGGAACT TGA 



This corresponds to the amino acid sequence <SEQ ID 36; ORF 006- La>: 

a006-l.pep (partial) 

. SQNHRKRLIG TFFLVGLENL LMLVYPVFGG WAINAVIAGQ AWQALLYALV 

VLLMWLVGAA RRIADTRTFT RIYTEIAVPV VLEQRQRQVP HSAVTARVAL 

HLPIAATSW SIFGACIMLL VLEFWVGVSA VGILALFLWL 



1 
51 
101 
151 
201 
251 

a006-l/m006-l 

a006-l . pep 



SREFVSFFEE 
LPRFAAISEN 
REAFGYLCVG 



LYFRLKNSLE 
TAMGILFGFA 



RDNHFIRKGD 
FVMMTLKGYS 



ERQLDRHYGL 
SAGHVYSVGT 



LARLRVLISN 
YLWMFAISLD 



DVPRLVEQYS NLKDIGQRIE WSKRNIKAGT * 



m006-l 



95.7% identity in 280 aa overlap 

10 . 20 30 40 50 

SQNHRKRL I GT FFLVGLEN LLMLV Y PV FGG WA IN AV I AGQAWQALL YAL WL 

: I : I I I I II I I I II I I I I I | | | | | || j M I I I I I I I I :: I I I I f | 

MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
10 20 30 40 50 60 



60 70 80 90 100 110 

a006-l . pep LMWLVGAARRIADTRT FTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSRE FVSFFEEHL 
M I I I I I : II M I M j I I I ! II 1 I I I M I M I i M M I I I I I I I I M I M I I I I I j I I M 
m006-l LMWLVGAVRRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

70 80 90 100 110 120 

120 130 140 150 160 170 

a006-l.pep PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERD 
I I I M I I I II I I I I I I I I II M I I I I I I I I I I I I I M I I I M M I I I I II I I I : I I I I I I 
m006-l PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 

130 140 150 160 170 180 

180 190 200 210 220 230 

a006-l . pep NHFIRKGDERQLDRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
M I I I I II : I I I M I I I I M I I I I I I I I I I I I I I I I I | I I I I I i I I I I I I I I I I I I I 1 I 
m006-l NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 

190 200 210 220 230 240 



240 250 260 270 280 

a006-l . pep GHVYSVGTYLWMFAISLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 
I I I I I I II I I I I I I : I I M I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I 
m006-l GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 37>: 

g007 . seq 

1 atgaACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGcgC 

51 CGCcGCTTCT GCCGccgaca acAGCatcat gaCaAAAGGG CAAAAAGTGT 

101 ACGAATCcAa ctGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGtTTC CTccgctTTT CCggtcgGac tgtattatga acaAACCGCa 

201 cgTCCtgctg cacagcatgg tcaaaggcAt cgacgggaca ttcaaagtgg 

251 agcggcaaaa cctacgacgg atttatgCcc gcaaccgcca tcagcgATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 38; ORF 007.ng>: 

g007 .pep 

1 MNTTRLFTAF ILCCLCAAAS A ADNSIMTKG QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD CIMNKPHVLL HSMVKGIDGT FKVERQNLRR IYARNRHQRC 

101 GHCRRRHLYH ERL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 39>: 
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m007 . seq 

1 ATGAACACAA CCCGACTGCC 

51 CGCCGCTTCT GCCGCCGACA 

101 ACGAATCCAA CTGCGTCGCC 

151 ACCATGTTTC CGCCGCTCTA 

201 GGTGCTGCTG CACAGCATGG 

2 51 AACGGCAAAA CCTACAACGG 

301 GGACATTGCC GCCGTCGCCA 



GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 

ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

CCGCTCCGAC TTCATCATGA AAAAACCGCA 

TCAAAGGCAT CAACGGTACA ATCAAAGTC . 

ATTCATGCCC GCAACCGCCA TCAGCGATGC 

CTTATATCAT GAACGCCTTT GA 



This corresponds to the amino acid sequence <SEQ ID 40; ORF 007>: 

m007 .pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARNRHQRC 

101 GHCRRRHLYH ERL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41>: 

a007 . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTC. 

251 AACGGCAAAA CCTACAACGG ATTCATGCCC GCCACTGCCA TCAGCGATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 42; ORF 007. a>: 

a007 .pep 

1 MNTTRLPTAL VLGCLCAAAS AA DNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARHCHQRC 

101 GHCRRRHLYH ERL* 



m007/a007 97.3% identity over a 1 13 aa overlap 

10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I M I I I I M I I I : I I I M I I I I I I I I I I I I I M II i I I M I! I I I I I M II I I i I I I I 
a 0 0 7 MNTTRLPTAL VLGCLCAAAS AADN S IMTKGQKVYE SNC VACHGKKGEGRGTMFPPLYRS D 

10 20 30 40 50 60 



70 80 90 100 110 

m007 . pep FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
I M I I I II I I I 1 I I I I II I I I I I I t I I I I I 1 I I I : I I I I I I I I I I I I I I I I I I 
a007 FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARHCHQRCGHCRRRHLYHERLX 

70 80 90 100 110 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 007 shows 86.7% identity over a 1 13 aa overlap with a predicted ORF (ORF 007 .ng) 
from N. gonorrhoeae: 

m007/g007 



10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I I t I I I I :: I I : I I I I I M I I I I I I I I I I I I I I I I : I I I M I I I I I I I I M I : I I I 
g007 MNTTRLPTAFILCCLCAAASAADNS IMTKGQKVYE SNCIACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 

70 80 90 100 110 

m007 . pep FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
I I : ! I : I I M 1 I I I I I : I I : i | I | | | : | | : || I M I I I I I I I I I I M I I I I 
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g007 CIMNKPHVLLHSMVKGIDGTFKVERQNLRRIYARNRHQRCGHCRRRHLYHERL 

70 80 90 100 110 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 43>: 

g007 -1 . seq (partial ) 

1 ATGAACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGTTTC CTCCGCTTTT CCGGTCGGAC TATATTATGA ACAAACCGCA 

2 01 CGTCCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 
251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

3 51 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAGGC AAAAAAAAC . 

This corresponds to the amino acid sequence <SEQ ID 44; ORF 007-l.ng>: 

g007-l .pep (partial) 

1 MNTTRLPTAF ILCCLCAAAS A ADNSIMTKG QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD YIMNKPHVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 

101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKG KKN... 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 45>: 

m007-l . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 
201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 
2 51 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 
351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAGC AAAAAAAAC T 
401 AA 

This corresponds to the amino acid sequence <SEQ ID 46; ORF 007- 1> 

m007-l .pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKS KKN* 

m007-i / g007-i 91 .7% identity in 133 aa overlap 

10 20 30 40 50 60 

m007-l . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
If I II I I I !:: I I : I I I I I I I I I I I I I I I I I I M I I I : I I I I M I M I I I I I I I : I I I 
g007-l MNTTRLPTAF I LCCLCAAASAADNS I MTKGQKVYESNC I ACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 



70 80 90 100 110 120 

mOQ7-l.pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
: I I : I I : M I I I I I I I I I M I I I I II I I I I I I I M I I II I I I I I M I I I I I I I I I I I I I I 
g007-l YIMNKPHVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 



130 

m007 - 1 . pep TEKDVKQAKSKKNX 

M M I I I I I : I I I 
g 0 0 7 - 1 TEKDVKQAKGKKN 

130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47>: 

a007-l . seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 
51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
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151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CCACTGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAAC AAAAAA. . 

This corresponds to the amino acid sequence <SEQ ID 48; ORF 007-1. a>: 

a0 07-1. pep (partial) 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKN KK.. 

moo7-i/aoo7-i 98.5% identity in 132 aa overlap 

10 20 30 40 50 60 

m007-l . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I M M I M ! It I I : I M ! I I I I I M I I M I I M I I I IE M I M I I I M I I I M I M I 1 f 
a007-l MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

"70 80 90 100 110 120 

m007-l .pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
I M ! I I I M I I II II I I I I I I II E I II M I II I I I U I I I I I I I I M I U M I I I I I M 1 
a007-l FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 

130 

mO 0 7 - 1 . pep TEKDVKQAKSKKNX 

111111111:1! 
a007-l TEKDVKQAKNKK 

130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 49>: 

g0O8 . seq 

1 ATGAACAACA GACATTTTGC CGTCAtcgCC TTGGGCAGCA AC C T T G AC AA 

51 CCCCGCACAA CAAATacgcg gcgcattaga cgcgctctcg tcccatcctg 

101 acatccggct tgaaCaggtt tcctcactgt aTatgaccgc acctgtcggt 

151 tacgAcaaTC agcccgATTT CATCaatgcc gTCTgcaccg TTTCCACCAC 

201 CtTGGACGGC ATTGcccTGC TTGCCgaACT CAAccgTATC GAAGCCGATT 

251 TCGGACGCGA aCGCAGTTTC CGCAATGCAC C G CG C AC AT T GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGCC TTACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAATACG GAAAGGTTGT CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGACA GGTAA 

This corresponds to the amino acid sequence <SEQ ID 50; ORF 008.ng>: 

g0O8 .pep 

1 MNNRHFAVIA LGSNLDNPAQ QIRGALDALS SHPDIRLEQV SSLYMTAPVG 

51 YDNQPDFINA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKYGKWELS 

151 KRLGNQGIRL LPDR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 51>: 

m008 . seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCTCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT TGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 TCTGGACGGC ATTGCCyTGC TTGCCGAACT CAACCGTATC GAGGCTGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GkATTTGGAC 

301 ATTATCGACT TTGACGGCAT * CTCCAGCGAC GACACsCGAC TcACCtTGCC 
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351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATCCGCCCT TTGGCAGAAA 
4 01 TCCTCCCTGA TTTTGTTTTA GGAAAACACG GAAAGGTTGC CGAATTGTCA 
4 51 AAACGGyTGG GCAATCAAGG TATCCGTCTT TTACCGGACA GGTAATT 

This corresponds to the amino acid sequence <SEQ ID 52; ORF 008>: 

m008.pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLXLD 

101 IIDFDGISSD DTRLTLPHPR AHERSFVIRP LAEILPDFVL GKHGKVAELS 

151 KRLGNQGIRL LPDR* 

The following partial DNA sequence was identified in K meningitidis<SEQ ID 53>: 

a008 . seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCCCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT CGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 CTTGGACGGC ATTGCCCTGC TTGCCGAACT CAACCGTATC GAAGCCGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGAC TCACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

401 TCCTCCCTGA TTTTATTTTG GGAAAACACG GAAAGGTTGC CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGATA AGTAA 

This corresponds to the amino acid sequence <SEQ ED 54; ORF 008. a>: 

a008 .pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKHGKVAELS 

151 KRLGNQGIRL LPDK* 

m008/a008 97.6% identity over a 164 aa overlap 

10 20 30 40 50 60 

m008 . pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVG YDNQPDFVNA 

I I II I It I i I If I I I If I I I I M I M 1 I I i I I I I I I I I I I I I I I I I I M I I I I I II I I I I 
a008 MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m008.pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 
I I I! I I ! I I 1 I I I 1 1 I I I I I t 11 1 1 I i 1 I 1 I I 1 l> M i I I I II I I II 1 I I 11 1 11 1 1 1 
a008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

m008 . pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 
I I I I I I I I I I I I I I I I I I : I I I I I II I I I I I I I I I I I I I I I I I : I 
a008 AHERSFVIRPLAEILPDFILGKHGKVAELSKRLGNQGIRLLPDKX 

130 140 150 160 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 008 shows 92.7% identity over a 164 aa overlap with a predicted ORF (ORF008.ng) 
from N. gonorrhoeae: 

m008/g008 

10 20 30 40 50 60 

m008.pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 
M I I I I f I I I I I I I I : I I I I I : I : I I | : | | | | | I I I I : I : I II I I I I I I I I I I I I I I : I I 
g008 MNNRHFAVIALGSNLDN PAQQIRGALDALSSHPDIRLEQVSSLYMTAPVGYDNQPDFINA 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m008 .pep VCTVSTTLDGIALIJVELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 
I I I I I I I M I I I I I I I I M I I ! I I I I I I I M I I 1 I I! 11 II M M I I 11 I MINIM 
g008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

Itv008 .pep AHERS FVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 

I I I M M I I I M I M M I : M I : M I : I M I M M I M M M I M 
g 008 AHERSFVIRPIoAEILPDFILGKYGKWELSKRLGNQGIRLLPDRX 

130 140 150 160 



The following partial DNA sequence was identified in AT gonorrhoeae <SEQ ID 55>: 

g009. seq 

1 ATGCCCCGCG CTGCCGTAGC CTTTGAGCGT CAT CATC AC A AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GCGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGAA ACCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTcg cGTTCCAAGC 

2 01 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

2 51 AaaaGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 56; ORF 009.ng>: 

g009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARNQSVMAV 
51 QLPLVAFSDK VWAFQAWQ AEIQVFA DGG KTWQKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 57>: 

m009 .seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC CGGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 58; ORF 009>: 

m009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPPVAFSDK VWAFQAWQ AEIQVFA DGG KTWQKP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 009 shows 97.7% identity over a 86 aa overlap with a predicted ORF (ORF 009.ng) 
from TV. gonorrhoeae: 

m009/g009 

10 20 30 40 50 60 

mOO 9 . pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
I M M M M I M I M I M I M M M I M M M M M M M I I I : M M M M i MINI 
g00 9 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARNQSVMAVQLPLVAFSDK 

10 20 30 40 50 60 

70 80 
m009 . pep WVAFQAWQAEIQVFADGGKTWQKPX 
M M M M I M M M 1 M M M M M I 
g00 9 WVAFQAWQAEIQVFADGGKTWQKPX 

70 80 
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The following partial DNA sequence was identified in K meningitidis <SEQ ID 59>: 

a009 . seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTC 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTCTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 60; ORF 009.a>: 

a009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPLVAFSDK WVAFQAVLQ AEIQVFA DGG KTWQKP* 

m009/a009 97.7% identity over a 86 aa overlap 

10 20 30 40 50 60 

m009.pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
M I M I I I I I I M I I I I I I I I I t M I I I t I I I M I I I I I I M II I I I II I I I I MUM 
a009 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPLVAFSDK 

10 20 30 40 50 60 

70 80 
mOO 9 . pep VWAFQAWQAE IQVFADGGKTWQKPX 

I M I I I I I : I I I I ! I I I I I I M I I I I I 
aOO 9 WVAFQAVLQAE I QV FADGGKTWQK PX 

70 80 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 61>: 

g01O . seq 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

4 01 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

451 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

701 AAG AT AT GG A ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAacgc 

801 cgacggcgaA cgcTTTATGG AAcgctatgc GCcgACCGta aAagaCTTGG 

851 CTTCTCGCga cgtGGTTTCA CgcgcGatgG CGatggaAAt ctatgaaggt 

901 cgcggctgTG GtaaAAAcaA agaCCacgtC TTACTGAAAA TCGACcAtAt 

951 cggtGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCagtttgc cGGTATCGAT CCGATTAAAG ACCCGATTcc ggttgTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCcg aCCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 

1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 62; ORF 010.ng>: 

g010 .pep 
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1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQN VRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLWF RPTPR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 63>: 

mOlO.seq (PARTIAL) 

1 . . nTCCAATTAT CCAAATCCGG TCTGAATTGT GCCGTTTTGT CTAAAGTGTT 

51 CCCGACCCGT TCGCATACCG TAGCGGCGCA GGGCGGTATT TCCGCCTCTn 

101 TGGGTAATGT GCAGGAAGAC CGTTGGGACT GGCACATGTA CGATACCGTG 

151 AAAGGTTCCG ACTGGTTGGG CGACCAAGAT GCGATTGAGT TTATGTGCCG 

201 CGCCGCGCCT GAAGCCGTAA TTGAGTTGGA ACACATGGGT ATGCCTTTTG 

251 ACCGTGTGGA AAGCGGTAAA ATTTATCAGC GTCCTTTCGG CGGCCATACT 

301 GCCGAACACG GTAAACGCGC GGTAGAACGC GyCTGTGCGG TTGCCGACCG 

351 TACAGGTCAT GCGATGCTGC ATACTTTGTA CCAACAAAAC GTCCGTGCCA 

4 01 ATACGCAATT CTTTGTGGAA TGGACGGCAC AAGATTTGAT TCGTGATGAA 

4 51 AACGGCGATG TCGTCGGCGT AACCGCCATG GAAATGGAAA CCGGCGAAgT 

501 TTATATTTTC CACGCTAAAG CTGTGATGTT TGCTACCGGC GGCGGCGGTC 

551 GTATTTATGC GTCTTCTACC AATGCCTATA TGAATACCGG CGATGGTTTG 

601 GGTATTTGTG CGCGTGCAGG TATCCCGTTG GAAGACATGG AATTCTGGCA 

651 ATTCCAGCCG ACCGGCGTGG CGGGTGCGGG CGTGTTGATT ACCGAA. . . . 

This corresponds to the amino acid sequence <SEQ ID 64; ORF 01 0>: 

mOlO.pep (PARTIAL) 

1 ..XQLSKSGLNC AVLSKVFPTR SHTVAAQGGI SASXGNVQED RWDWHMYDTV 

51 KGSDWLGDQD AIEFMCRAAP EAVIELEHMG MPFDRVESGK IYQRPFGGHT 

101 AEHGKRAVER XCAVADRTGH AMLHTLYQQN VRANTQFFVE WTAQDLIRDE 

151 NGDVVGVTAM EMETGEVYIF HAKAVMFATG GGGRIYASST NAYMNTGDGL 

201 GICARAGI PL EDMEFWQFQP TGVAG AG VL I TE . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65>: 

a010 . seq 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACAT GGGT A TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

401 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

751 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CAT GAT T AAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 
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1301 ATGCCGGCGA ACTGACCCGC 

1351 GATGGTGAAA ACGTTGATGC 

14 01 ATTGCACGCC GGCGTGTTCC 

14 51 GAGAAGTCAT GGCGATTGCC 

1501 AAGAGCAAAG TGTGGAATAC 

1551 CCTAATTGAA GTGGCGAAAG 

1601 AATCACGCGG TGCGCACGCT 

1651 AACTGGATGA AACATACGCT 

17 01 CAAACCGGTG CACACCAAGC 

17 51 AGCGCGTTTA TTGA 



CAACGTATCG AGCGTTTGGA CAATCAAACT 
ATTGCGCCGC GAACTGCAAC GCTCCGTACA 
G TACT GAT GA GATTCTGAGC AAAGGCGTTC 
GAGCGTGTGA AACGTACCGA AATCAAAGAC 
CGCGCGTATC GAGGCTTTGG AATTGGATAA 
CGACTTTGGT GTCTGCCGAA GCACGTAAAG 
TCAGACGACC ATCCTGAGCG CGATGATGAA 
GT AC CAT T C A GATGCCAATA CCTTGTCCTA 
CTTTGAGCGT GGAATACATC AAACCGGCCA 



This corresponds to the amino acid sequence <SEQ ID 66; ORF 010.a>: 

aOlO.pep 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDVVGV TAMEMETGEV YIFHAKAVMF 

2 01 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLVVF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DANTLSYKPV HTKPLSVEYI KPAKRVY* 



moio/aoio 98.7% identity over a 23 1 aa overlap 



10 20 30 

m010 .pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

f I I I I I I M I I I I I I I I II 1 i I I | | 1 | M | | | Ml 
aOlO MGFPVRKFDAVIVGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

10 20 30 40 50 60 



40 50 60 70 80 90 

mOlO.pep QEDRWDWHMY DTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

M I I M I II II I I f I I I I I M I I I I I I I I 1 I I I I M | | | | | | | 1 | | | | | | | | | | | m | || 
aOlO QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO.pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

I M II I I I II I I I I I I I I I I I I | I I I | | | | M I I I I I I I I I | | | | | | | || | | | | | | M | 
aOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO . pep T AMEMETGE V Y I FHAKAVM FATGGGGR I YAS S TN AYMNTG DG LG I C ARAG I P LE DME FWQ 

I I I I M I I M II I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || || 
aOlO TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

190 200 210 220 230 240 

220 * 230 
mOlO . pep FQPTGVAGAGVLITE 
I : I I I I I I I I I II M 

aOlO FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 

250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 010 shows 98.7% identity over a 231 aa overlap with a predicted ORF (ORF OlO.ng) 
from N. gonorrhoeae: 

mOlO .pep/g010 .pep 

10 20 30 

mOlO.pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

I I M t I M I I I I I I I I I I M M I I ] I I I I I I I Ml 
g010 MGFPVRKFDAVIVGGGGAGLRAALQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO . pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
I I M I I I I I M I I I I I I I I II I | | M I I I I I I I I I I I M I I II I I I I I | | | I | | | |f | | | 
g010 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 

I I I I I I 1 I I I I I I I II I I I I I I I M It I I I I M M I M I I I II I I i M | | | | | | | | | | | 
g010 GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 

130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO . pep TAMEMETGEVYI FHAKAVMFATGGGGRI YASSTNAYMNTGDGLGICARAGI PLEDMEFWQ 

M I I I I I I I I M I I I I I I I I M I I I I I I I I I M I I I I I M I I I I I I I I I M I I I II I I I I 
g010 TAMEMETGEVYI FHAKAVMFATGGGGRI YASSTNAYMNTGDGLGICARAGI PLEDMEFWQ 

190 200 210 220 230 240 

220 230 
mOlO . pep FQPTGVAGAGVLITE 
I : I I I I I I I t I I I I I 

g010 FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 

250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 67>: 

g010-l .seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

4 01 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

4 51 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA AT G C T TAT AT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

7 01 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAACGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTGGTTTCA CGCGCGATGG CGATGGAAAT CTATGAAGGT 

901 CGCGGCTGTG GTAAAAACAA AGACCACGTC TTACTGAAAA T CGACCAT AT 

951 CGGTGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAG AT T T C C A 

1001 TTCAGTTTGC CGGTATCGAT CCGATTAAAG ACCCGATTCC GGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 
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1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 68; ORF 010-l.ng>: 

g010-l .pep 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GG I SASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI R DENG DWG V TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLVVF RPTPR* 

g010-l / P10444 

sp| P10444 |DHSA_ECOLI SUCCINATE DEHYDROGENASE FLAVOPROTEIN SUBUNIT 

gnl | PID| dl015210 (D90711) Succinate dehydrogenase, flavoprotein [Escherichia coli] gi | 1786 942 
(AE000175) succinate dehydrogenase flavoprotein. subunit [Escherichia coli] Length = 588 

Score = 1073 (495-6 bits), Expect = 6.7e-169, Sum P(2) = 6.7e-169 

Identities =* 191/303 (63%) , Positives = 238/303 (78%) 



Query : 


1 


MG F PVRKFDAVI VXXXXXXXXXXXXXSKSGLNCAVLSKVFPTRSHTVAAQGG I S AS LGNV 


60 






M PVR+FDAV++ S+SG CA+LSKVFPTRSHTV+AQGGI+ +LGN 




Sb j Ct : 


1 


MKLPVREFDAWIGAGGAGMRAALQlSQSGQTCAIjLSKVFPTRSHTVSAQGGITVALGNT 


60 


Query -. 


61 


QEDR\TOWHfm)TVKGSDWLGDQDArEFMCRAAPEAVlEIiEHMGMPFDRVESGKIYQRPFG 


120 






ED W+ WHMYDTVKGSD + +GDQDAI E +MC + PEA++ELEHMG+ PF R++ G+IYQRPFG 




Sb j Ct : 


61 


HEDNWEWHMYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFG 


120 


Query -. 


121 


GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 


180 






G + G R A ADRTGHA+LHTLYQQN++ +T F EW A DL+++++G WG 




Sb j Ct : 


121 


GQ S KNFGG EQAARTAAAADRTGHALLHTLYQQNLKNHTT I FS E WYALDLVKNQDGAVVG C 


180 


Query : 


181 


TAMEMETGEVYI FHAKAVMFATGGGGRI YASSTNAYMNTGDGLG ICARAGI PLEDMEFWQ 


240 




TA-f +ETGEV F A+A + ATGG GRIY S+TNA++NTGDG+G+ RAG+P++DME WQ 




Sb j Ct : 


181 


TALCIETGEVVYFKARATVLATGGAGRIYQSTTNAHINTGDGVGMAIRAGVPVQDMEMWQ 


240 


Query : 


241 


FHPTGVAG AG VL ITEG VRGEGG I LLNADGERFMERYAPTVKDLASRDWSRAMAME I YEG 


300 






FHPTG+AGAGVL+TEG RGEGG LLN GERFMERYAP KDLA RDW+R++ +EI EG 




Sbjct t 


241 


FH PTG I AGAG VLVT EG CRG EGG YLLNKHG ERFMER YAPNAKDLAGRD WARS I M I E I REG 


300 


Query : 


301 


RGC 303 








RGC 




Sb j Ct : 


301 


RGC 303 




Score 


= 249 


(115.0 bits). Expect = 6.7e-169, Sum P(2) = 6.7e-169 




Identities = 


= 53/102 (51%) , Positives = 62/102 (60%) 




Query : 


309 


HVLLKI DH I GAEKI MEKLPG I RE I S I QFAGXXXXXXXXXXXXTTHYMMGG I PTNYHGEW 


368 




H LK+DH+G E + +LPGI E+S FA T HYMMGGIPT G+ + 




Sb j ct : 


310 


HAKLKLDHIA3KE VLES RLPG I IiELSRTFAHVD P VKE P I P VI PTCHYMMGG I PTKVTGQ AL 


369 


Query: 


369 VP QGD E Y EV P VKG L Y AAG E CAC A S VHG ANRLGTN S LLDLW F 410 








+V V GL+A GE AC SVHGANRLG NSLLDLWF 




Sb j Ct : 


370 


TVNEKGEDVWPGLFAVGE IACVS VHGANRLGGNSLLDLWF 411 





The following partial DNA sequence was identified in N. meningitidis <SEQ ID 69>: 

m010-l .seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCAGCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAg 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

4 01 CCTGTGCGGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 
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4 51 CAACAAAACG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGTCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTACC ACGGCGAAGT 

1101 TGTCGTTCCG CAAGGTGAAG ATTACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCTTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACCAACTCCC TGTTGGACTT GGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 ATGCAGGTGA GTTGACCCGC CAACGTATCG AGCGTTTGGA CAACCAAACC 

1351 GATGGTGAAA ACGTTGATGC ATTGCGTCGC GAACTGCAAC GCTCTGTACA 

14 01 ACTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTGATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CG AT G AT G AA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATATCAATA CCTTGTCCTA 

1701 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

1751 AGCGCGTTTA TTGATGA 



This corresponds to the amino acid sequence <SEQ ID 70; ORF 010-1>: 

m010-l.pep- . 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGMV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GRTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI R DENG DWG V TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGEDYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DINTLSYKPV HTKPLSVEYI KPAKRVY* 

mOlO-1 / g010-l 99.5% identity in 410 aa overlap 



10 20 30 40 50 60 

m010-l . pep MGFPVRKFDAVI VGGGGAGLRAALQLSKS GLNCAVLSKVFPTRS HTVAAQGG ISASLGNV 

I I t I i I I I t I I I I I I I I I I ! I I I I I I I I I I 1 I I ! I I I I I I I I I I I I I I I I I I I II I I I I I 
g01O-l MGE*PVBKFDAVI VGGGGAGLRAALQLS KS GLNCAVLSKVFPTRS H TVAAQGGI S ASLGNV 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOlO - 1 . pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVXELE HMGMPFDRVE SGKI YQRPFG 

I I I f t I I I ( I I I I I I I I I I M I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I ! I I 
g010-l QEDRWDWHMYD TVKG SDW1. GD QD AI EFMCRAAPEAVI ELE HMGMPFDRVE S GKI Y QRPFG 

70 80 90 100 110 120 



130 140 150 160 170 180 

m010-l . pep GHTAEHGKRAVERACAVAJDRTGHAMLHTLYQ^ 

I I I M I I I II I I I ! I 1 I 1 t I I I I I I ) I 1 I 111 I I I ) I I I 1 I I i I I I 1 i I 1 1 I I ) I H i I t 
g010- 1 GHTAEHGKRAVERACAVADRTGHAMLHinLYQQNVRAOTrQFFVEWTAQDI.1 

130 140 150 160 170 180 



190 200 210 220 230 240 

mOlO- 1 . pep TAMEMETGEVY I FHAKAVMFATGGGGRI YAS STNAYMNTGDGLGI CARAGIPLEDMEFWQ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( ( 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 

gOlO- 1 TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 
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190 



190 200 210 220 230 240 

250 260 270 280 290 300 

mO 1 0 - 1 . pep FHPTGVAC^GVLITEGVRGEGGIIJjNADGEREWERYAPTVKDIASRDVVSRiUdAMEIYEG 
I 1 I I I I I I I I I I I I I I 1 I I M I I I I I I I | | ) | | M | | | | | | | | | | | | | | | | | | | | | | | | | 
g010-l FHPTGVAGAGVLITEGVRGEGGII^ADGERFMERYAPTVKDIA^ 

250 260 270 280 290 300 

310 320 330 340 350 360 

m010-l .pep RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 
I I I I I 1 I ! I I t I I I I t I t I I I I I I I I I I I I I I | I I I I I I I I | | | | | | | I | | | | | | | | | | | 
g010-l RGCGKNKDHVLLKIDHIGAEKIMEKTPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m010-l . pep TNYHGE WVPQGED YE VP VKGL YAAGE CACASVHGANRLGTNSLLDL WFGKAAGD SMI K 

I I I I I I M 1 t I I I I I I I I I I I I i I I I I I I I I I I I I I 1 I I I ! I [ I I I I I 
gO 10 - 1 TNYHGE VWPQGDE YE VPVKGIj YAAGE CACAS VH GANRIiGTNSLLDLWFRPTPRX 

370 380 390 400 410 

430 440 450 460 470 480 

mO 1 0 - 1 . pep FI KEQSDWKPIjPANAGELTRQRI ERLDNQTDGENVD ALRRE LQRS VQL HAGVFRTDE ILS 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 71>: 

a010-l.seq. . 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

4 01 CCTGTGCNGT TGCCGACCGT AC AG G T CAT G CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGG CACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA AT GCCTAT AT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

7 01 AAGAC AT GG A ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 ATGCCGGCGA ACTGACCCGC CAACGTATCG AGCGTTTGGA CAATCAAACT 

1351 GATGGTGAAA ACGTTGATGC ATTG CGCCGC GAACTGCAAC GCTCCGTACA 

14 01 ATTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 G AG AAGT CAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGT ATC GAGGCTTTGG AATTGGATAA 

1551 CCTAATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGATGATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATGCCAATA CCTTGTCCTA 

17 01 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

17 51 AGCGCGTTTA TTGA 

This corresponds to the amino acid sequence <SEQ ID 72; ORF 010-l.a>: 

a010-l.pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 
51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 
101 HMGMPFDRVE SGKI YQRPFG GHTAEHGKRA VE RA CAVA D R TGHAMLHTLY 
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191 



151 
201 
251 
301 
351 
401 
451 
501 
551 



QQNVRANTQF 
ATGGGGRIYA 
VLITEGVRGE 
RGCGKNKDHV 
TTHYMMGGIP 
TNSLLDLVVF 
DGENVDALRR 
KSKVWNTARI 
NWMKHTLYHS 



FVEWTAQDLI 
SSTNAYMNTG 
GGILLNADGE 
LLKIDHIGAE 
TNYHGEWVP 
GKAAGDSMIK 
ELQRSVQLHA 
EALELDNLIE 
DANTLSYKPV 



RDENGDVVGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGDEYEVPVK 
FIKEQSDWKP 
GVFRTDEILS 
VAKATLVSAE 
HTKPLSVEYI 



TAMEMETGEV 
IPLEDMEFWQ 
KDLASRDVVS 
EISIQFAGID 
GLYAAGECAC 
LPANAGELTR 
KGVREVMAIA 
ARKESRGAHA 
KPAKRVY* 



YIFHAKAVMF 
FH PTGVAGAG 
RAMAMEIYEG 
PIKDPIPVVP 
ASVHGANRLG 
QRIERLDNQT 
ERVKRTEIKD 
SDDHPERDDE 



m010-l / a010-l 99.3% identity in 587 aa overlap 



40 



50 



60 



10 20 30 

aOlO - 1 . pep MGFPVRKFDAVIVGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

I I M III Ml Ml II M IMM IIIIMIIIIIIIMIIIIIIIIIMIIIIIIIINI 

aO 1 0 - 1 MGFPVRKFDAVI VGGGGAGLRAALQLSKSGIiNCAVLSKVFPTRSHTVAAQGGI SASLGNV 

10 20 30 40 50 60 

70 80 90 100 110 120 

aOl 0 - 1 . pep QE DRWD WHMYDT VKG SDWLGDQD AI E FMCRAAPE A VI E LEHMGM PFDR VE SG KI YQR P FG 

M MM! Mill! II II M IM I IMMM MM IIMIIMM M M IMM MMMI I 

mOlO - 1 QEDRWDWHMYDT VKG SDWLGDQD AI E FMCRAAP E A VI E L E HMGMP FDR VE SG K I YQR P FG 

70 80 90 100 110 120 



130 140 150 160 170 180 

aOlO-l.pep GHT AEHGKRAVE RAC AVADRTGHAMIjHT L YQQNVRANTQ F FVEWT AQD L I RDENGDVVGV 

I I MMM I MMMI MM MM II I I M MIIMM M MM M I IMMMIMIIM I 

m010-l GHTAEHGK^VERACAVADRTGHAMI^TLYQQ^rVRANTQFFVEWTAQDLIRDENGDVVGV 

130 140 150 160 170 180 

190 200 210 220 230 240 

aOl 0 - 1 . pep TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLG I CARAG IPLEDMEFWQ 

M M M.i I I MM! M I Ml MM IMIIIIMI II I Mill M I M I III Ml M 

mO 1 0 - 1 TAMEMETGEVYI FHAKAVMFATGGGGRI YASSTNAYMNTGDGLGI CARAG I PLEDMEFWQ 

190 200 210 220 230 24 0 

250 260 270 280 290 300 

aO 1 0 - 1 . pep FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDVVSRAMAMEIYEG 

I M M I M MMM M I MMM II MMMI Ml M I Mill IM! MM! IMMI I l 

m010-l FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 

250 260 270 280 290 300 

310 320 330 340 350 360 

a010-l .pep RG CG KNKDHVLLK I DH I GAE KI MEKLPG I RE I S I QFAG I DP I KD P I P WPTTHYMMGG I P 

I Ml M I! MIIMM III MM I MM IMMIIM MMM MM MMM MMM I 

m010-l RG CGKNKDHVLLKI DH I GAEK I ME KL PG I RE I S I QFAG I D P I KDP I P WPTTHYMMGG I P 

310 320 330 340 350 360 

370 380 390 400 410 420 

aO 1 0 - 1 . pep TNYHGEVWPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWFGKAAGDSMIK 

I I II II II IMMM Mill II IMM II MMMI MMM MINI MM M MM I 

mO 1 0 - 1 TNYHGEVVVPQGEDYEVPVKGLYAAGECACASVHGANRLGTNSLLDLVVFGKAAGDSMI K 

370 380 390 400 410 420 



430 440 450 460 470 480 

a010-l .pep FIKEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEILS 

I I MMIIMIM! MIIIIIMIIIMIIMIIMMIIIII MMMMMIMIMI 

mO 1 0 - 1 F I KEQS DWK P LPANAG E LTRQR I ERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEI LS 

430 440 450 460 470 480 

490 500 510 520 530 540 

a 0 1 0 - 1 . pep KG VREVMAI AERVKRTE I KDKS KVWNTAR I EALELDNL I E VAKATLVSAE ARKESRGAHA 

I I II M MIIMM MMM II I IMM II IMMM MMM MINI! Ill Mil II I 

m010-l KG VREVMA I AERVKRTE I KDKSKVWNTAR I EALEIjDNL IE VAKATLVSAE ARKESRGAHA 

490 500 510 520 530 540 

550 560 570 580 

aO 1 0 - 1 . pep SDDHPERDDENWMKHTLYHSDANTLSYKPVHTKPLSVEYIKPAKRVYX 

I I II II MIIMM MIMM I Ml IMMM IMMIIM] IMM 
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m010- 1 SDDHPERDDEMWMKHTLYHSDINTLSYKPVHTKPLSVEYIKPAKRVYX 

550 560 570 580 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 73>: 

gOll . seq 

1 ATGAAGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAC CCGCCTTACC GAAGATATGA AAACCGCGAT GCGCGCCAAA 

151 GATCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAATGCCG CCGTCAAACA 

201 GTTTGAAGTA GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

251 TCCTGACCAA AATGGTCAAA CAGCGCAAAG ACGGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGACGT 

351 GCTGCACCGC TACCTGCCGC AAATGCTCTC CGCCGGCGAA ATCCGCACCG 

4 01 CCGTCGAAGC AGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGCAAAGTGA TGGTCGTATT GAAAAcccGC CTCGCCGGCA AAGccgATAT 

501 GGGCGAAGTC AACAAAATCT TGAAAAccGt aCTGACCGCC tga 

This corresponds to the amino acid sequence <SEQ ID 74; ORF 01 1 .ng>: 

gOll .pep 

1 MKTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKTRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDGAKIYT 

101 EAGRQDLADK ENAEIDVLHR YLPQMLSAGE IRTAVEAAVA ETGAAGMADM 

151 GKVMWLKTR LAGKADMGEV NKILKTVLTA * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 75>: 

mOll . seq (partial) 

1 AT G AGG AC AC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAT CCGCCTTACC GAAGACATGA AAACCGCGAT GCGCGCCAAA 

151 GACCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAACGCCG CCGTCAAACA 

201 GTTTGAAGTG GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

2 51 TCCTGACCAA AATGGTCAAA CAGCGAAAAG ACAGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGAGGT 

351 ACTGCACCGC TACCTTCCCC AAATGCTTTC CGCCGGCGAA ATCCGTACCG 

4 01 AGGTCGAAGC TGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGTAAAGTCA TGGGGCTGCT GAAAACCCGC CTCGCAGGTA AAGCCGA. 

This corresponds to the amino acid sequence <SEQ ID 76; ORF 01 1>: 

mO 1 1 . pep (partial ) 

1 MRTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKIRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDSAKIYT 

101 EAGRQDLADK ENAEIEVLHR YLPQMLSAGE IRTEVEAAVA ETGAAGMADM 

151 GKVMGLLKTR LAGKA 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 01 1 shows 95.8% identity over a 165 aa overlap with a predicted ORF (ORF 01 l.ng) 
from N. gonorrhoeae: 

mOll/gOll 



mOll . pep 
gOll 



mOll .pep 
gOll 



10 20 30 40 50 60 

MRTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKIRLTEDMKTAMRAKDQVSLGTIRL 

> : I M I I M I I I I I I I I I I I | | | | | | | | | | | I t I I I I I M I I I I II I I M I I I 1 

MKTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKTRLTEDMKTAMRAK DQVSLGTIRL 

10 20 30 40 50 60 

70 80 90 100 110 120 

INAAVKQFEVDERTEADDAKITAILTKMVKQRKDSAKIYTEAGRQDLADKENAEIEVLHR 
1 1 I I I I I I I I M I I I I ! M I I I I I M | I I I M t I : I I M I I I M I M I M M I M : I I I | 
INAAVKQFEVDERTEADDAKITAILTKMVKQRKDGAKIYTEAGRQDLADKENAEIDVLHR 
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70 80 90 100 110 120 

130 140 150 160 

mOll . pep YLPQMLSAGEIRTEVEAAVAETGAAGMADIYIGKVMGLLKTRLAGKA 
1 I II I M I II I I I 1 1 I II I I I It M II 1 I I II i : 1 I I I I I I I I 
gOll YLPQMLSAGEIRTAVEAAVAETGAAGMADMGKVMVVLKTRLAGKADMGEVNKILKTVLTA 

130 140 150 160 170 180 

gOll X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 77>: 

g012 . seq 

1 ATGCTCGCCC GTCGCTATTT TTTCAATATC CAACCCGGGG CGGTTTTCAC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGCCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACa 

201 gGcggTGGAT ATTCGgcact tccgCcacca cacccaccga accgatgacc 

251 gcaaacggaG CGGAAACAAT TTTATCCGCc acacacgcca tcatatagcc 

301 gcCGCTTGCC GCGACCTTAT CGAcggcgac ggTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCGCCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CTTTTTTTTC CTGATGTTTT GTCTCTTCCT 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 78; ORF 012.ng>: 

g012 -pep 

1 MLARRYFFNI QPGAVFTDKL LEQLMRFLQF LPEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRHFRHHTHR TDDRKRSGNN FIRHTRHHIA 

101 AACRDLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPAL LQTLFLCFGF 

201 RLFLFLFFFF LMFCLFLA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 79>: 

m012 . seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

351 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 51 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

501 nnnnnnnnnn nnnnnnnnnC AACACAAAAA GGCGTGATTT nTGCGTTTCG 

551 GCAGATTTCT CCCCACCCTC CTTCAAACGT TTTTCcTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTGT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 80; ORF 012>: 

m012 -pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARXXXXXXX xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 XXXXXXXXXX XXXXXXXXXX XXXQHKKA*F XRFGRFLPTL LQTFFLCFGF 

201 RLFLFLFLFF LMLCLFPA* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 81>: 

a012 . seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

2 01 GGCGGTGGAT ATTCGGTACT TCCGCTACAA CACCCACCGA ACCGACAATC 

2 51 GCAAACGGAG CGGAAACAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 ACCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

401 CCGCCCGGAC TTTCCAATCT AAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GAAGATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 82; ORF 012.a>: 

a012 .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR I FTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 R LFLFLFLFF LMFCLFPA * 

moi2/aoi2 64.2% identity over a 21 8 aa overlap 

10 20 30 40 50 60 

MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I I I I I I I I I I I M II If I I I I I II I I I i I I I I III I I I I M I I I I I M I I I I I I I I I I I I 
MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
10 20 30 40 50 60 

70 80 90 100 110 120 

NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
I I I I II I I I II I I I I : : I II I I I if M I : I I II I II II I I : I I : 
NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
70 80 90 100 110 120 

130 140 150 160 170 180 

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 
: : : : Mill I 

PKLR S RQT VT VNRAART FQ S KQN LI FRLGNQKHRRN LMTQG F YG VC I Q I AVK I QHKKAG F 
130 140 150 160 170 180 

190 200 210 219 

XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I M I I I M I 1 1 : I ! I II II I II II I M I I I : I I I II I 
LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
190 200 210 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 012 shows 58.7% identity over a 21 8 aa overlap with a predicted ORF (ORF 012.ng) 
from N. gonorrhoeae: 

m012/g012 

10 20 30 40 50 60 

m012 . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
Nil : I : I I I M :: I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I II II 
g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 



m012 .pep 
a012 

m012 .pep 
a012 

m012 .pep 
a012 

m012 .pep 
a012 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m012 . pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
I I M I M I I I I I : I M I I I I I I : I I ! i I : i M I I I f I I I : I I : 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m012 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

: Mill I 

g012 PKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 

190 200 210 219 

m012 .pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I I I M I : I I i I : I II I I I I I I I I I I : I M I : I I I II 
g012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 83>: 

m012-l.seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

251 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CyTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT GAGCAGAACC T CAT CT T C AG GCTTGGCAAT 

451 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GC AG ATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 84; ORP 012-1>: 

m012-l.pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARRHLIDGD GQRNIAFAQT XKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 
201 R LFLFLFLFF LMFCLFPA * 

m012-l/g012 91.7% identity in 218 aa overlap 

10 20 30 40 50 60 

MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
fill : I : I II I I :: I I I I I I I I I I I I I I I I ! I I I I I I I I I I M ! I I I I I I I I I I I I 
MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 
10 20 30 40 50 60 

70 80 90 100 110 120 

NIMFFQQAVD I RY FRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 
I I M I I I I I f I I : I I I I I M I I : I II I I : I I I I I I I I I I : M f I I I I I I M I I I I I I I 
NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 
70 80 90 100 110 120 

130 140 150 160 170 180 

XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQG FYGVCIQIAVKIQHKKAGF 
I II I I I I I I I I I I I I I I I I I I I I I I II I II II I I I I I I I I I I I I I I I I I I M I I I I I I I 
PKLRSRQTVTWHAARTFQSEQNL I FRLGNQKHRRNLMTQG FYGVCIQIAVKIQHKKAGF 



m012-l.pep 
g012 

m012-l.pep 
g012 

m0l2-l .pep 
Q012 
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130 



140 



150 



160 



170 



180 



190 200 210 219 

m0l2-l .pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I I I I I : I I I I I I I I II I I II I I M : I I I I I I I I I I 
g012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was 

a012-l.seq 

ATGCTCGCCC GTTGCCACTT 
TGACAAACTG CTTGAACAAC 
TTCTGTTTGC CCTTTTCCGT 
AAATTCGCCC GCCGTCATCA 
GGCGGTGGAT ATTCGGTACT 
GCAAACGGAG CGGAAACAAT 
ACCGCTCGCC GCCACCTTAT 
CGCGCAAACG CCTAAGCTGC 
CCGCCCGGAC TTTCCAATCT 
CAAAAGCACC GCCGTAATCT 
ACAAATCGCC GTCAAAATCC 
GAAGATTTCT CCCCACCCTC 
CGCCTTTTCC TTTTTCTTTT 
CGCTTAA 



identified in N. meningitidis <SEQ ID 85>: 



l 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



CCTCAATATC 
TGATGCGTTT 
ATTTTCACCC 
CATCCACATC 
TCCGCTACAA 
TTTATCCGCC 
CGACGGCGAC 
GAAGCCGCCA 
AAGCAGAACC 
CATGACGCAA 
AACACAAAAA 
CTTCAAACGC 
CCTCTTTTTC 



CAATTGAGGG 
CCTCCAGTTC 
ATAAAAGTAA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATCTTCAG 
GGATTCTACG 
GGCGGGATTT 
TTTTTCTCTG 
CTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAATC 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCTTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 86; ORF 012-1. a>: 

a012-l . pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRN I AFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 
201 R LFLFLFLFF LMFCLFPA * 

a012-l/m012-l 97.2% identity in 218 aa overlap 

10 20 30 40 50 60 

a012-l . pep M LARCH FLNI QLRAVLADKL LEQLMRFLQFLS EFLFALFRI FTHKSNRALKFARRHH I HI 
I I I II I I 1 i 11 I I 1 M t II I i M II I I II I I I I I I I I I I I I I 1 I II 11 i 1 1 i I II I M I I 
m012-l MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 

70 80 90 100 110 120 

a012-l . pep NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
I M f I I I I I I I I I I f : : I I I I I I I I I M : I I I I I I I I I I I : 1 I M I I I II I I I i I II I II 
m012-l NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

a012-l . pep PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
I I I I I I II I I II I I I I I M : I I I I I I I I I II II I I I I I I I I I I I II ! I I I I I II I M I I 
m012~l XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 

190 200 210 219 

a012-l .pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I i I I I I 1 I I I I I I 1 I I I ! I H I I i ! M 1 I I I I I I I 1 
m012-l LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 87>: 

g013 . seq 

1 aTgcctttga ccatgctgtg cagcaGGAcg tGCGGTTtgt tcataataca 

51 gtCcgaccGG AAAagcggAG GAAaCGCAGT GCCGCGCCCT TCCCCTTTCT 

101 TGCCGTGGCA GGCGATGCag tTgGATTCGT ACACTTTTTG CCCTTTtGtc 
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151 atgatGCTgt tgtcggCGGC AGAAGCgGCG GcgCAGAGGC AGCACAAGAT 

201 GAAGGCGGTC GGCAGTCGGG TTGTGTtcat tGgcgTTTCC cctaatgttt 

251 tgaaaccttg ttttttgatt Ttgcctttac ggggtgaaaa gtttttTtgg 

301 cccaaatccg gaatttag 

This corresponds to the amino acid sequence <SEQ ID 88; ORF 013.ng: 

g013 .pep 

1 MPLTMLCSRT CGLFIIQSDR KSGGNAVPRP SPFLPWQAMQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQHKMKAV GSR VVFIGVS PNVLKPCFLI LPLRGEKFFW 
101 PKSGI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 89>: 

m013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAAGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTCATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT TTTGCCGAAT 

301 CAGTAG 

This corresponds to the amino acid sequence <SEQ ID 90; ORF 01 3>: 

m013.pep 

1 MPLTMLCSST CGFFMMKSER XSGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQKQPKTRAV GSR VVFIGVS FMFETLLLI L RSGXKIFLPN 
101 Q* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 91>: 

a013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAGGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTAATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT CTTGCCGAAT 

301 CGGTAG 

This corresponds to the amino acid sequence <SEQ ID 92; ORF 013.a>; 

a013 .pep 

1 MPLTMLCSST CGFFMMKSER *SGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQPKTRAV GSR VVFIGVS LMFETLLLIL RSG*KIFLPN 
101 R* 

moi3/aOi3 97.0% identity over a 101 aa overlap 

10 20 30 40 50 60 

m013.pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
I I I I t I I I I I I I II I I I I I I I I f I I t II II I I I I I I I I I I I M M i I I I I i I I I I I I I | I 
a013 MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 

70 80 90 100 

m0l3.pep AQKQPKTRAVGSRVVFIGVSFMFETLLLILRSGXKIFLPNQX 
t I : I I I 1 1 I I II I I M I I i I : I I I t ] i I 1 I i I I 1 i I I I M : I 
a013 AQRQPKTRAVGSRVVFIGVSLMFETLLLILRSGXKIFLPNRX 

70 80 90 100 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 013 shows 73.3% identity over a 101 aa overlap with a predicted ORF (ORF 013.ng) 
from N. gonorrhoeae: 
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m013/g0l3 



10 20 30 40 50 60 

m013 . pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
I M I I I I I I I I : I : : : I : I I I I I I M M I I I II I I I II I I I I I I I I I M I I I I I I I 
g013 MPLTMLCSRTCGLFIIQSDRKSGGNAVPRPSPFLPWQAMQLDSYTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 



70 80 90 100 

m013 . pep AQKQPKTRAVGSRVVFIGVSF-MFETLLLILR-SGXKIFLPNQX 

11:1 I : I I II I I I I I I I I : : : : I I I I I : I I : 
g013 AQRQHKMKAVGSRVVFIGVSPNVLKPCFLILPLRGEKFFWPKSGIX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 93>: 

g0l5 . seq 

1 ATGCAGTATC TGATTGTCAA AT AC AGC C AT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTTTTTTCCT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGT CGGCTTTTGG AAAGCACTGC CCCACCTCAA CGAC AC GATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAG ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGCA CAAAAATCCT GCTCCTGTTC GCCTACATCG 

251 CACTGGGCAT GGTAATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCTAT GTGTTGCATC GCCTGCATCG TTTACCTTGC 

3 51 CAAAACCAAA GTCCTGCCAT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 94; ORF 015.ng>: 

g015 .pep 

1 MQYLIVKYSH Q IFVTITILV FNIRFFLLW K NPEKPLVGFW KALPHLNDTM 

51 LLFTGLWLMK ITHFSPFNAP WLGT KILLLF AY I ALGMVMM RAR PRSTKFY 

101 TVYLLAMCCI ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 95>: 

mOlS.seq (partial) 

1 . . AAAATCAGAA AAGCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA 

51 CGACAC CATG CTGCTGTTTA CGGGATTGTG G CTG ATG AAA ATTACCCATT 

101 TCTCCCCGTT CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC 

151 GCCTATATCG CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC 

201 CAAGTTCTAC ACCGTTTACC TGCTCGCCAT GTGTTGCGTC GCCTGCATCG 

251 TTTACCTTGC CAAAACCAAA GTCCTGCCTT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 96; ORF 015: 

mOlS.pep (partial) 

1 . . KIRKALAGFW KALPHLNDTM LLFTGLWLMK ITHFSPFNAP WLGTKILLLL 
51 AY I ALGMMMM RARPRSTKFY TVYLLAMCCV ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 97>: 

a015 . seq 

1 ATGCAGTATC TGATTGTCAA AT AC AG CC AT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTGTTTTCNT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA CGACACCATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC GCCTATATCG 

251 CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCCAT GTGTTGCCTC ACCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCTT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 98; ORF 015.a>: 

a015 . pep 
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1 MQYLIVKYSH Q IFVTITILV FNIRV FXL WK NPEKPLAGFW KALPHLNDTM 
51 LLFTGLWLMK ITHFSPFNAP WLGTK ILLLL AYIALGMMMM RA RPRSTKFY 
101 TVYLLAMCCL TCIVYLAKTK VLPF* 



moi5/aOis 96.7% identity over a 91 aa overlap 



10 20 30 

m015.pep KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

I I t M I I I I I I I I I i | | | | [ | | | | | | |f | 
a015 LIVKYSHQIFVTITILVFNIRVFXLWKNPEKPLAGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 

40 50 60 70 80 90 

mOlS.pep FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCVACIVYLAKTICVLP 
M I I I I M I I I I I I I II I I 1 I I I I I I M | | 1 | | | M I II M ! I I j I :: M | M | M I I E j 
a015 FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCLTCIVYLAKTKVLP 

"70 80 90 100 110 120 



m015 .pep 



a015 



FX 
I I 

FX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 015 shows 94.5% identity over a 91 aa overlap with a predicted ORF (ORF 015.ng) 
from N. gonorrhoeae: 

m015/g015 

10 20 30 

m015 . pep KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

I hllllllllllllllll MINIMI! 

9015 L I VKYSHQ I FVTI TI LVFNI RFFLLWKNPEKPLVGFWKALPHIjNDTMIjLFTGLWLMKI TH 

10 20 30 40 50 60 

40 50 60 70 80 90 

FS PFNAPWLGTKI LLLIAYI ALGMMT^ 

MMMM Mill Ml MINIM: 1 1 1 1 1 II II 1 1 N 1 1 h I I II I II I II 1 1 1 

FSPFNAPWLGTKILLLFAYIALGMVmRARPRSTKFYTVYLLAMCCIACIVYLAKTKVLP 
70 80 90 100 110 120 



m015 .pep 
g015 



m015 .pep 
g015 



FX 

N 

FX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 99>: 

g018 .seq 

1 atGCAGCAGG GGCagttggt tggacgcgtc gcccgcaata AAGATATGCG 

51 GAATgctggt CTGCATggtC AGCGGATCGG CAACGGGtac gccgcgcgcg 

101 tCtttgTCGA TATTGATGTT TTCCAAACCG ATATtgTCAA CGTTCGGACG 

151 GCgACCTACG GCTGCCAACA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT tCtACATTGC CGTCTGCGTC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT tCtTCTCCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 100; ORF 018.ng>: 

g018 .pep 

1 MQQGQLVGRV ARNKDMRNAG LHGQRIGNGY AARVFVDIDV FQTDIVNVRT 

51 ATYGCQHIFG NKY AFFAILL PMDFYIAVCV EFDLGFSIQM QFQFFSEHGF 
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101 RLV* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 101>: 

m018 . seq 

1 ATGCAGCAGA GGCAGTTGGT TGGACGCATC GCCTGCGATG AAGATATGCG 

51 GAATACTGGT CTGCATGGTC AGCGGGTCGG CAACAGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATATT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCCACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCATC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT TCTTCGCCGA ACACGGCGTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 102; ORF 018>: 

m018.pep 

1 MQQRQLVGRI ACDEDMRNTG LHGQRVGNRY AARIFFDIDI FQTDIVNVRT 
51 AAHGCQHIFG NKY AFFAILL PMDFYIAVCI EFDLGFSIQM QFQFFAEHGV 
101 RLV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 103>: 

a018 . seq 

1 ATGCAGCAGG GGCAGTTGGT TGGACGCGTC GCCCGCAATA AAGATATGCG 

51 GAATACTGGT CTGCATAGTC AGCGGATCGG CAACGGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATGTT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCTACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCGTC GAGTTTGGCC 

251 TCGGTTTTAG CATCCAAATG CAGTTTCAAT TCTTCACCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 104; ORF 018.a>: 

a018 . pep 

1 MQQGQLVGRV ARNKDMRNTG LHSQRIGNGY AARIFFDIDV FQTDIVNVRT 
51 AAYGCQHIFG NKY AFFAILL PMDFYIAVCV EFGLGFSIQM QFQFFTEHGF 
101 RLV* 

moi8/aOi8 86.4% identity over a 103 aa overlap 

10 20 30 40 50 60 

m018 . pep MQQRQLVGRI AC DEDMRNTGLHGQRVGNRYAARI FFDI DI FQTDI VNVRTAAHGCQHI FG 

111 t I I I I : I : : I I I I I I I I : I I : I ! M I I I I I II I : I I I I ! I I I I I I I : | | | | | i | 
a018 MQQGQLVGRVARNKDMRNTGLHSQRIGNGYAARIFFDIDVFQTDIVNVRTAAYGCQHIFG 

10 20 30 40 50 60 

70 80 90 100 

m018 . pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I I I M I II I I I I I I I I I I : I I f I I I I I II I I I I : I I I MM 
a018 NKYAFFAILLPMDFYIAVCVEFGLGFSIQMQFQFFTEHGFRLVX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 018 shows 84.5% identity over a 103 aa overlap with a predicted ORF (ORF 018.ng) 
from N. gonorrhoeae: 

m018/g018 

10 20 30 40 50 60 

m018 . pep MQQRQLVGR I AC DE DMRNTGLHGQRVGNRYAARI FFDI D I FQT D I VN VRTAAHGCQH I FG 

IN IMIMI : : I I M : I II I I I : i I MUM M I : M I I I I M I I I : : II M II I 
901S MQQGQLVGRVARNKDMRNAGLHGQRIGNGYAARVFVDIDVFQTDIVNVRTATYGCQHIFG 

10 20 30 40 50 60 
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70 80 90 100 

m018 . pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
M I I I I I I I II I I I I I I I I : I I If I I I I I I I I I M : I I I MM 
g018 NKYAFFAILLPMDFYIAVCVEFDLGFSIQMQFQFFSEHGFRLVX 

70 80 90 100 



The following partial DNA sequence was identified in M gonorrhoeae <SEQ ID 105>: 

g019.seq (partial) 

1 . . ctgctggcgg ccctggtgct tgccgcgtgt tcttcgACAA ACAcacTGCC 
51 AGCCGGCAAG ACCCCGGCAG ACAATATAGA AActgcCgAC CTTTCGGCAA 
101 GCGTTCCCAC ccgcCCTGCC GAACCGGAAG GAAAAACGCT GGCAGATTAC 
151 GGCGGCTACC CGTCCGCACT GGATGCAGTG AAACAGAACA ACGATGCGGC 

201 AGCCGCCGCC TATTTGGAAA Acgcaggaga cagCGcgatg gcGGAAAatg 
251 tccgcaagga gtgGCTGa 

This corresponds to the amino acid sequence <SEQ ID 106; ORF 019.ng>; 

g019.pep (partial) 

1 . . LLAALVIjAAC SSTNTLPAGK TPADNIETAD LSASVPTRPA EPEGKTlxADY 
51 GGYPSALDAV KQNNDAAAAA YLENAGDSAM AENVR KEWL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 07>: 

m019.seq (partial) 

1 ATGTACCTAC CCTCTATGAA GCATTCCCTG CCGCTGCTGG CGGCCCTGGT 

51 GCTTGCCGCG TGTTCTTCGA CAAACACACT GCCAGCCGGC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCCGCCCT 

151 GCCGAACCCG AAAGAAAAAC GCTGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCACGCAG ACAGTGGACG CTGTTTGCAC AGGAATACGC 

351 CAAACTCGAA CCGGCAGGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

4 01 CGAGCCGCAA CGACTATACG CGTGCCGCTG AACTGGTCAA AAATACGGGC 

451 AAACTGCCTT CGGGCTGCAC CAAACTGTTG G AACAGG C AG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTAGACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 CGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCCTTGCGC GCCCGACGTT GGGACGAGCT GGCCTCCGTT 

901 ATCTCGCATA TGCCCGAAAA ACTGCAAAAA AGCCCGACCT GGCTCTACTG 

951 GCTGGCACGC AGCCGCGCCG CAACGGGCAA CACGCAAGAG GCGGAAAAAC 

1001 TTTACAAACA GGCGGCAGCG ACGGGCAGGA ATTTTTATGC GGTGCTGGCA 

1051 GGGGAAGAAT TGGGTCGGAA AAT CGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAAAC AGCGTCCGCC GCATGGCGGA AGACGGTGCA GTCAAACGCG 

1151 CACTGGTACT GTTCCAAAAC AG C CAATCTG CCGGTGATGC AAAAATGCGC 

1201 CGTCAGGCTC AGGCGGAATG GCGTTTTGCC ACACGCGGCT TTGACGAAGA 

1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

13 01 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

13 51 CGCTATATTT CGCCGTTTAA AGACACGGTA ATCCGCCACG CGCAAAATGT 
1401 TAATGTCGAT CCGG CTTGGG TTTATGGGCT GATTCGTCAG GAAAGCCGCT 

14 51 TCGTTATAGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 
1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 
1551 ACAACTTTAC ACCGCCGACG GG. . . 

This corresponds to the amino acid sequence <SEQ ID 108; ORF 019>: 

m019.pep (partial) 

1 MYLPSMKHSL PLIAALVLAA CSSTNTLPAG KTPADNI ETA DLSASVPTRP 
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51 AEPERKTLAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LFAQEYAKLE PAGRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKS PNAAAL LSEMESGLSL EQRS FAWGVL 

251 GHYQSQNLNV PAALDYYGKV ADRRQLTDDQ IEWYARAALR ARRWDELASV 

3 01 ISHMPEKLQK SPTWLYWLAR SRAATGNTQE AEKLYKQAAA TGRNFYAVLA 

3 51 GEELGRKIDT RNNVPDAGKN SVRRMAEDGA VKRALVLFQN SQSAGDAKMR 

4 01 RQAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
451 RYISPFKDTV IRHAQNVNVD PAWVYGLIRQ ESRFVIGAQS RVGAQGLMQV 
501 MPATAREIAG KIGMDAAQLY TADG . . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 109>: 

a019. seq 

1 ATGTACCCAC CCTCTCTGAA GCATTCCCTG CCGCTGCTGG TGGNCCTGGT 

51 GCTTGCCGCG TGTTCTTNGA CAAACACACT GTCAGCCGAC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCNGCCCT 

151 GCCGAACCCG AANGAAAAAC GTNGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCGCGCAG ACAGTGGACG CTGTNTGCAC ANGAATATGC 

351 NAAACTCGAA CCGGCANGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

4 01 CGAGCCGCAA CGACTATACG CGTGCCGCCG AACTGGTCAA AAATACGGGC 

4 51 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC C AC AG ACG C A CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 NGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCNNTNNGC NNNCGNNGTT NGNANGANNT GGCNNCCGNN 

901 ANCNCGNNNN TGCNNGANAA ACNNNNNNAN AGNCNNANNT NGNTNNANTG 

951 NNTGGCACGC AGCCGCGCCG CNACGGGCAA CACGCAANAN GCGGANAAAC 

1001 TNTACAAACA GGCGGCAGCA NCGGGCANGA ATTTTTATGC NGTGCTGNCN 

1051 GGGGAAGAGT TGGGGCGCAN AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAANC AGCGTCCTCC GTATGGCGGA AGACGGCGCG ATTAAGCGCG 

1151 CGCTGGTGCT GTTCCGAAAC AGCCGAACCG CCGGCGATGC GAAAATGCGC 

1201 CGTCNGGCTC AGGCGGAATG GCGTTTCGCC ACACGCGGCT TCGATGAAGA 

1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

1301 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

1351 CGCTACATTT CGNNNNNTNA NGACACGGTA ATCCGCCACG CGCAAAATGT 

1401 TAATGTCGAT CCGGCGTGGG TTTACGGGCT GATTCGTCAG GAAAGCCGCT 

14 51 TCGTTATGGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 

1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 

1551 ACAACTTTAC ACCGCCGACG GCAATATCCG TATGGGGACG TGGTATATGG 

1601 CGGACACCAA ACGCCGCCTG CAAAACAACG AAGTCCTCGC CACCGCAGGC 

1651 TATAACGCCG GTCCCGGCAG GGCGCGCCGA TGGCAGGCGG ACACGCCCCT 

1701 CGAAGGCGCG GTATATGCCG AAACCATCCC GTTTTCCGAA ACGCGCGACT 

17 51 ATGTCAAAAA AGTGATGGCC AATGCCGCCT ACTACGCCTC CCTCTTCGGC 

1801 GCGCCGCACA TCCCGCTCAA ACAGCGTATG GGCATTGTCC CCGCCCGCTG 

1851 A 

This corresponds to the amino acid sequence <SEQ ID 110; ORF 019.a>: 

a019.pep 

1 MYPPSLKHSL PLLVXLVLAA CSXTNT LSAD KTPADNIETA DLSASVPTXP 

51 AEPEXKTXAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LXAXEYAKLE PAXRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKS PNAAAL LSEMESGLSL EQRS FAWGVL 

251 GHYQSQNLNV PAALDYXGKV ADRRQLTDDQ IEWYARAAXX XRXXXXXAXX 

301 XXXXXXKXXX XXXXXXXXAR SRAATGNTQX AXKLYKQAAA XGXN FYAVLX 

351 GEELGRXIDT RNNVPDAGKX SVLRMAEDGA IKRALVLFRN SRTAGDAKMR 

4 01 RXAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
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4 51 RYISXXXDTV IRHAQNVNVD PAWVYGLIRQ ESRFVMGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADGNIRMGT WYMADTKRRL QNNEVLATAG 

551 YNAGPGRARR WQADTPLEGA VYAETIPFSE TRDYVKKVMA NAAYYASLFG 

601 APHIPLKQRM GIVPAR* 

moi9/aOi9 88.9% identity over a 524 aa overlap 



10 20 30 40 50 60 

m0l9 . pep MYLPSMKHSLPLLAALVLAACSSTNTLPAGKT PADNIETADLSASVPTRPAE PERKTLAD 

II 11:1111111: I I M I II MM I I I M II I M M II M II I II I I I II II 
a019 MYPPSLKHSLPLLVXLVLAACSXTNTLSADKTPADNIETADLSASVPTXPAEPEXKTXAD 
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mOl 9 . pep YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 
M M I M M I M I M I M I M M I M I I M M M M i M II M i M I M M I M I I M 
a0l9 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLXAXEYAKLE 

70 80 90 100 110 120 



130 140 150 160 170 180 

mOl 9 . pep PAGRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 
M I II I I I II II II I I I II II I M I II I M II II M II I II I II II II II I I II I I I II 
a019 PAXRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

130 140 150 160 170 180 



190 200 210 220 230 240 

mOl 9 . pep LLAGRQTTDARN LAAALG S PFDGGTQG SRE Y ALLN V I GKE ARKS PN AAALLSEMESGLS L 

I M I M M I M M M 1 I M M I M M M M M M I M M II M M II M It M M M M ( 
a019 LLAGRQTT DARN LAAALGS PFDGGTQG SRE YALLNVIGKEARKSPNAAALLSEMESGLSL 

190 200 210 220 230 240 



250 260 270 280 290 300 

mOl 9 . pep EQRSFAWGVLGHYQSQNLNVPAALDYYGKVADRRQLTDDQIEWYARAALRARRWDELASV 

M II I It II II M I M M M M M I I II M 1 M II II M M M II I) I I 
a019 EQRSFAWGVLGHYQSQNLNVPAALDYXGKVADRRQLTDDQIEWYARAAXXXRXXXXXAXX 

250 260 270 280 290 300 



310 320 330 340 350 360 

m019 . pep ISHMPEKLQKSPTWLYWLARSRAATGNTQEAEKLYKQAAATGRNFYAVLAGEELGRKIDT 
I : II II II II I I I I II I II M I : I I I M I I II I II I Ml 

aO 1 9 XXXXXXKXXXXXXXXXXXARSRAATGNTQXAXKLYKQAAAXGXNFYAVLXGEELGRXIDT 

310 320 330 340 350 360 



370 380 390 400 410 420 

mO 1 9 . pep RNNVPDAGKNS VRRMAEDGAVKRALVLFQNSQSAGDAKMRRQAQAEWRFATRGFDEDKLL 

I II I M I I I It ! I I M I I : M I M I I : I 1 : : M M f I t I I 1 I M M II I f I I 1 M I I 
a019 RNNVPDAGKXSVLRMAEDGAIKRALVLFRNSRTAGDAKMRRXAQAEWRFATRGFDEDKLL 

370 380 390 400 410 420 



430 440 450 460 470 480 

mOl 9 . pep TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYISPFKDTVIRHAQNVNVDPAWVYGLIRQ 
II I t II I I I I I M I I I I II I I I I I M II I I I II I I I II II I I M I II I I I II II I II 
a019 TAAQTAFDHGFYDMAVNSAERTDRKLMYTLRYISXXXDTVIRHAQMVNVDPAWVYGLIRQ 

430 440 450 4 60 470 480 



490 500 510 520 

mOl 9 . pep ESRFVIGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADG 
II II I : I t II I II I I I II M M II I I II I I I I M I I I M I M I I 
a019 ESRFVMGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADGNIRMGTWYMADTKRRL 

490 500 510 520 530 540 



a019 



QN N E V L AT AG YN AG P G RAR RW Q A DT P LE G AV Y AE TIPFSETR D Y VKK VMAN AA Y Y AS L FG 
550 560 570 580 590 600 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 019 shows 95.5% identity over a 89 aa overlap with a predicted ORF (ORF 019.ng) 
from N. gonorrhoeae: 

g019/m019 

10 20 30 40 49 

g019 .pep LLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPEGKTLAD 

M MM II II II II MM I I MM I II MM I I III I I I MM Mill 

m019 MYLPSMKHSLPLLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPERKTLAD 

10 20 30 40 50 60 

50 60 70 80 89 

g 0 1 9 . pep YGG YP S ALDAVKQNNDAAAAAY LENAGDS AMAENVRKEWL 

I M M MM II II MMI MMM MMM M MMMM 

mO 1 9 YGGYPSALDAVKQKNDAAVT^AYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 1>: 

g023 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 AATGCAGCGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCCCTG CCTAAAGAAT ATCCGGCATG GCAGGCATTT 

151 TTTAGTCAAG CTTGGGTAAA AGTATTTACC CAAGTGAGCT TTATCGCCGT 

2 01 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT TGtCtGGCTG 

3 01 GTCGGCTGCC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 023. ng>: 

g023 .pep 

1 MVERKLTGAH YGLRDWVMQR AT AV I ML I YT VALLWLFAL P KEYPAWQAF 

51 FSQAWVKVFT QVSFIAVFLH AWVG I RDLWM DYIKPFGV RL FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

m023.seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 GATGCAACGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTTCCCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

2 51 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTTTGGCTG 

3 01 GTCGGCTGTC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 023>: 

m023 . pep 

1 MVERKLTGAH YGLRDWVMQR ATAV I MLIYT VALLWLFSL P KEYSAWQAF 

51 FSQTWVKVFT QVSFIAVFLH AWVGI RDLWM DYIKPFGVR L FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 5>: 

a023. seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GGGATTGGGC 

51 GATGCAACGT GCGACCGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCTCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 



WO 99/57280 



PCT/US99/09346 



205 



201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATNA 
2 51 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTCTGGCTG 
301 GTCGGCTGCT TGGTGTATTC AATTAAAGTA ATTTGGGGGT AA 



This corresponds to the amino acid sequence <SEQ ID 116; ORF 023. a>: 

a023.pep 

1 MVERKLTGAH YGLRDWAMQR ATA V IMLIYT VALLVVLFAL P KEYSAWQAF 

51 FSQTWVK VFT QVSFIAVFLH AWVGI RDLWM DYXKPFGVR L FLQVATIVWL 

101 VGCLVY SIKV IWG* 

m023/a023 96.5% identity over a 1 13 aa overlap 



10 20 30 40 50 60 

m023 . pep MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLVVLFSLPKEYSAWQAFFSQTWVKVFT 
I I It I I I I M I I I M I : I I III M M I I I I I I I I I i M : I t I t I I I I i I I II I I I I M I I 
a023 MVERKLTGAHYGLRDWAMQRATAVIMLIYTVALLVVLFALPKEYSAWQAFFSQTWVKVFT 

10 20 30 40 50 60 



70 80 90 100 110 

m02 3 . pep QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 
I I I f I I I I I I I I I I I I I II I I I I I I I II I (I I I I I I I I I I I I I II I : I M I N 
a023 QVSFIAVFLHAWVGIRDLWMDYXKPFGVRLFLQVATIVWLVGCLVYSIKVIWGX 

70 80 90 100 110 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 023 shows 97.3% identity over a 1 13 aa overlap with a predicted ORF (ORF 023. ng) 
from N. gonorrhoeae: 

g023/m023 

10 20 30 40 50 60 

g023 . pep MVERKIjTGAHYGLRDWMQRATAVIMLIYTVALLVVLFALPKEYPAWQAFFSQAWVKVFT 

1 1 1 11 1 1 1 M 1 1 1 M 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 M M II • 1 1 1 1 1 II II I II i : 1 1 1 M I 

m02 3 MVERKLTGAHYGLRDWVNQRATAVIMLI YTVALLWLFS LP KEYSAWQAF FSQTWVKVFT 

10 20 30 40 50 60 



70 80 90 100 110 

g023 . pep QVS F I AVFLHAWVG I RDLWMDY I KP FGVRLFLQVATI VWLVGCLVYS VKVI WGX 

MIIIIIIUMIiMIIIMIIIIIUMIIIMIIIIIIIIMMMIIMI 

m023 QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 7>: 

g025 . seq 

1 ATGTTGAAAC AAAcgACACT TTTGGCAGCT TGTACCGCCG TTGCCGCTCT 

51 GTTGGGCGGT TGcgCCACCC AACAGCCTGC TccTGTCATT GCAGGCAATT 

101 CAGGTATG C A GAC CGTATCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 

151 ACGCCGTACA ATGCCGCTCC TGCCGCCAac gatgcGCCgT ATGTGCCGCC 

2 01 CGTGCAAact gcgccggttT ATTCGCCTCC TGCTTATGTT CCGCcgtCTG 
251 CACCTGCCGT TTCGGgtaca tatgtTCCTT CTTACGCACC CgtcgACATC 

3 01 aacgCGGCGa cgCataCTAT TGTGCGTGGC GACACgGtgt a c a ACATTTc 

3 51 caaAcgCtac CATATCTCTC AAGACGATTT CCGTGCGTGG AACGGCATGA 

4 01 CCGACAATAC GTTGAGCATC GGTCAGATTG TTAAAGTCAA ACCGGCaggA 
451 TATGCCGCAC CGAAAACCGC AGCCGTAGAA AGCAGGCCCG CCGTACCGGC 
501 TGCCGCGCAA ACCCCTGTGA AACCCGCCGC gcaACCGCCC GTTCAGTCCG 
551 CGCCGCAACC TGCCGCGCCC GCTGCGGAAA ATAAAGCGGT TCCCGCCCCC 
6 01 GCGCCCGCCC CGCAATCTCC TGCCGCTTCG CCTTCCGGCA CGCGTTCGGT 
651 CGGCGGCATT GTTTGGCAGC GTCCGACCCA AGGTAAAGTG GTTGCCGATT 
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701 
751 
801 
851 
901 
951 
1001 

This corresponds to the amino acid sequence <SEQ ID 118; ORF 025.ng>: 

g025 .pep 

1 MLKQTTLLAA CTAVAALLGG CATQQPAPVI AGNSGMQTVS SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQT APVYSPPAYV PPSAPAVSGT YVPSYAPVDI 

101 NAATHTIVRG DTVYNISKRY HISQDDFRAW NGMTDNTLSI GQIVKVKPAG 

151 YAAPKTAAVE SRPAVPAAAQ TPVKPAAQPP VQSAPQPAAP AAENKAVPAP 

2 01 APAPQSPAAS PSGTRSVGGI VWQRPTQGKV VAD FGGGN KG VD I AGNAGQP 

2 51 VLAAADGKW YAGSGLRGYG NLVIIQHNSS FLTAYGHNQK LLVGEGQQVK' 

3 01 RGQQVALMGN TDASRTQLHF EVRQNGKPVN PNSYIAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 19>: 



m025 . seq 


(partial ) 










1 


. . GTGCCGCCGG 


TGCAAAGCGC 


GCCGGTTTAT 


ACGCCTCCTG 


CTTATGTTCC 


51 


GCCGTCTGCA 


CCTGCCGTTT 


CGGGTACATA 


CGTTCCTTCT 


TACGCACCCG 


101 


TCGACATCAA 


CGCGGCGACG 


CATACTATTG 


TGCGCGGCGA 


CACGGTGTAC 


151 


AACATTTCCA 


AACGCTACCA 


TATCTCTCAA 


GACGATTTCC 


GTGCGTGGAA 


201 


CGGCATGACC 


GACAATACGT 


TGAGCATCGG 


TCAGATTGTT 


AAAGTCAAAC 


251 


CGGCAGGATA 


TGCCGCACCG 


AAAGCCGCAG 


CCGTAAAAAG 


CAGGCCCGCC 


301 


GTACCGGCTG 


CCGCGCAACC 


GCC CGTAC AG 


TCCGCACCCG 


TCGACATTAA 


351 


CGCGGCGACG 


CATACTATTG 


TGCGCGGCGA 


CACGGTGTAC 


AACATTTCCA 


401 


AACGCTACCA 


TATCTCTCAA 


GACGATTTCC 


GTGCGTGGAA 


CGGCATGACC 


451 


GACAATATGT 


TGAGCATCGG 


TCAGATTGTT 


AAAGTCAAAC 


CGGCAGGATA 


501 


TGCCGCACCG 


AAAACCG CAG 


CCGTAGAAAG 


CAGGCCCGCC 


GTACCGGCTG 


551 


CCGTGCAAAC 


C C CTGTG AAA 


CCCGCCGCGC 


AACCGCCTGT 


GCAGTCCGCG 


601 


CCGCAACCTG 


CCGCGCCCGC 


TGCGGAAAAT 


AAAGCGGTTC 


CCGCGCCCGC 


651 


CCCGCAATCT 


CCTGCCGCTT 


CGCCTTCCGG 


CACGCGTTCG 


GTCGGCGGCA 


701 


TTGT TTGGCA 


GCGTCCGACG 


CAAGGTAAAG 


TGGTTGCCGA 


TTTCGGCGGC 


751 


AACAACAAGG 


GTGT CGATAT 


TGCCGGTAAT 


GCGGGACAGC 


CCGTTTTGGC 


801 


GGCGG CTG AC 


GGCAAAGTGG 


TTTATGCCGG 


TTCAGGTTTG 


AGGGGATACG 


851 


GAAACTTGGT 


CATCATCCAG 


CATAATTCTT 


CTTTCCTGAC 


CGCATACGGG 


901 


CACAAC C AAA 


AATTGCTGGT 


CGGCGAGGGG 


CAGCAGGTCA 


AACGCGGTCA 


951 


GCAGGTTGCT 


TTGATGGGCA 


ATACCGATGC 


TTCCAGAACG 


CAGCTTCATT 


1001 


TCGAGGTGCG 


TCAAAACGGC 


AAACCGGTTA 


ACCCGAACAG 


CTATATCGCG 


1051 


TTCTGA 











This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 025>: 

m025 .pep (partial) 

1 . .VPPVQSAPVY TPPAYVPPSA PAVSGTYVPS YAP VD I NAAT HT I VRGDTVY 

51 NISKRYHISQ DDFRAWNGMT DNTLSIGQIV KVKPAGYAAP KAAAVKS R PA 

101 VPAAAQPPVQ SAPVDINAAT HT I VRGDTVY NISKRYHISQ DDFRAWNGMT 

151 DKTMLSIGQIV KVKPAGYAAP KTAAVESRPA VPAAVQTPVK PAAQPPVQSA 

201 PQPAAPAAEN KAVPAPAPQS PAASPSGTRS VGGIVWQRPT QGKWADFGG 

251 NNKGVDIAGN AGQPVLAAAD GKWYAGSGL RGYGNLVI I Q HNSSFLTAYG 

3 01 HMQKLLVGEG QQVKRGQQVA LMGNTDASRT QLHFEVRQNG KPVNPNSYIA 

351 F* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

a025 . seq 

1 ATGTTGACAC CAACAACACT TTAGGTAGCT TGTACCGCCC TTGCCGCTCA 

51 GTTGGGCGGA TGCCCCACCC AACACCCTTC TCCTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTACCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 



TCGGCGGCGG CAACAAGGGT GTCGATATTG CCGGCAATGC CGGACAACCC 
GTTTTGGCGG CGGCTGACGG CAAAGTGGTT TATGCCGGTT CAGGTTTGAG 
GGGATACGGA AACTTGGTCA TCATCCAGCA CAATTCCTCT TTCCTGACCG 
CGTACGGGCA CAACCAAAAA TTGCTGGTCG GCGAAGGTCA GCAGGTCAAA 
CGCGGTCAGC AGGTTGCTTT GATGGGTAAT ACCGATGCTT CCAGAACGCA 
GCTTCATTTC GAGGTGCGTC AAAACGGCAA ACCGGTTAAC CCGAACAGCT 
ATATCGCGTT CTGA 
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151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 



ACGCCGTACA 
GGTGCAAAGC 
CACCTGCCGT 
AACGCGGCGA 
CAAATGCTAC 
CCGACAATAC 
TATGCCGCAC 
TGCCGCGCAA 
CGCATACTAT 
CATATCTCTC 
GTTGAGCATC 
CGAAAGCCGC 
ACCCCTGTGA 
TGCCGCGCCC 
CTCCTGCCGC 
CAGCGTCCGA 
GGGTGTCGAT 
ACGGCAAAGT 
GTCATCATCC 
AAAATTGCTG 
CTTTGATGGG 
CGGCAAAACG 



ATGCCGCTCC 
GCGCCGGTTT 
TTCGGGTACA 
CCCATACTAT 
CATATCTCTC 
GTTGAGCATC 
CGAAAGCCGC 
CCGCTCGTAC 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AACCCGCCGC 
GCTGCGGAAA 
TTCGCCTTCC 
CGCAAGGTAA 
ATTGCAGGAA 
GGTTTATGCA 
AGCATAATTC 
GTCGGCGAAG 
CAATACCGAG 
GCAAACCGGT 



TGCCGCCAAC 
ATANGCCTCC 
TACGTTCCTT 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AGTCCGCACC 
GACACGGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
GCAACCGCCT 
ATAAAGCGGT 
GGCACGCGTT 
AGTGGTTGCC 
ATGCGGGACA 
GGTTCCGGTT 
TTCCTTCCTG 
GCCAGCAGGT 
GCTTCTAGAA 
TAATCCGAAC 



GATGCGCCGT 
TGCTTATGTT 
CTTACGCANC 
GACACCGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
CGTCGACATC 
ACAACATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
GTGCAGTCCG 
TCCCGCGCCC 
CGGTCGGCGG 
GATTTCGGCG 
GCCCGTTTTG 
TGAGGGGATA 
ACCG CAT ACG 
CAAACGCGGG 
CGCAGCTTCA 
AGCTATATCG 



This corresponds to the amino acid sequence <SEQ ID 112; ORF 

a025.pep 

1 MLTPTTL*VA CTALAAQLG G CPTQHPSPVI AGNSGMQTVP 

51 TPYNAAPAAN DAPYVPPVQS APVYXPPAYV PPSAPAVSGT 

101 NAATHT I VRG DTVYKISKCY HISQDDFRAW NGMTDNTLSI 

151 YAAPKAAAVK SRPAVPAAAQ PLVQSAPVDI NAATHTIVRG 

201 HISQDDFRAW NGMTDNTLSI GQIVKVKPAG YAAPKAAAVK 

251 TPVKPAAQPP VQSAPQPAAP AAENKAVPAP APQSPAASPS 

301 QRPTQGKWA DFGGNNKGVD IAGNAGQPVL AAADGKWYA 

351 VIIQHNSSFL TAYGHNQKLL VGEGQQVKRG QQVALMGNTE 

401 RQNGKPVNPN SYIAF* 



ATGTGCCGCC 
CCGCCGTCTG 
CGTCGACATC 
ACAAGATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
AACGCGGCGA 
CAAACGCTAC 
CCGACAATAC 
TATGCCGCAC 
TGCCGTGCAA 
CGCCGCAACC 
GCCCCGCAAT 
CATTGTTTGG 
GCAACAACAA 
GCGGCGGCTG 
CGGCAATTTG 
GGCACAACCA 
CAGCAGGTCG 
TTTCGAGGTG 
CGTTCTGA 

025. a>: 

SAPVYNPYGA 
YVPSYAXVDI 
GQIVKVKPAG 
DTVYNISKRY 
SRPAVPAAVQ 
GTRSVGGIVW 
GSGLRGYGNL 
ASRTQLHFEV 



m025/a025 97.4% identity over a 351 aa overlap 

10 20 30 

m025.pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

I I I M I I I i I : I I I I I I I t I I I I I I I I I I I 
a025 GMQTVPSAPVYNPYGATPYNAAPAANDAPYVPPVQSAPVYXPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 

40 50 60 70 80 90 

m025 . pep YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 

I I M I I I J I I I I I I I I I i I : I I I I [ I I I I I | | | | | | | | | | | I | | | J | | | | M | | | | | | 
a025 YAXVDINAATHTIVRGDTVYKISKCYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 

100 110 120 130 140 150 

100 110 120 130 140 150 

m025.pep KAAAVKSRPAVPAAAQPPVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 
M t I I I I ! I I I I! I I I I I I I I I I I I I M I I I! I I I I M II I I I II I I I I I I I I r ! f I It 
a025 KAAAVKSRPAVPAAAQPLVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 
160 170 180 190 200 210 

160 170 180 190 200 210 

m025.pep DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

II M I M I I I I I I I I I I I M : M I : I I I I I I I M I I I I I I I I M I I I I i I I I I I I I I I I 
a025 DNTLSIGQIVKVKPAGYAAPKAAAVKSRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

220 230 240 250 260 270 

220 230 240 250 260 270 

m025 . pep KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAAAD 
I ■ I I I N I I II I I I ! I I I I I II M I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I 
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a02 5 KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKVVADFGGNNKGVDIAGNAGQPVLAAAD 
280 290 300 310 320 330 

280 290 300 310 320 330 

m025 . pep GKVVYAG SGLRGYGNLV I I QHNS S FLTAYGHNQKLLVGEGQQVKRGQQVALMGNT DASRT 

I M I I I I I I I I I I I I I I I I I I II I I ! I I I I t I I I I I I I I M I I I I I I I I I I | | | | : | | | | 
a025 GKVVYAG S GLRG YGN LV 1 1 QHN S S FLT AYGHNQKLLVGEGQQVKRGQQVALMGNTE AS RT 

340 350 360 370 380 390 

340 350 
m025.pep QLHFEVRQNGKPVNPNSYIAFX 
I I I I I If It I I I I I I I I I I I I I 
a025 QLHFEVRQNGKPVNPNSYIAFX 
400 410 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 025 shows 75.6% identity over a 353 aa overlap with a predicted ORF (ORF 025.ng) 
from N. gonorrhoeae: 

m025/g025 

10 20 30 

m025 . pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

Mllhlllhlllllllllllllllllll 

g025 GMQTVSSAPVYNPYGATPYNAAPAANDAPYVPPVQTAPVYSPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 

40 50 60 70 80 90 

YAPVD I NAATHT I VRGDTVYNI S KRYH I S QDD FRAWNGMTDNTLS I GQ I VKVKPAG YAAP 

J 1 1 r 1 1 1 1 1 1 1 1 1 J r i r j j 1 1 1 1 1 e 1 1 e 1 1 1 1 j 1 1 1 e i e i j 1 1 1 1 1 1 1 1 j i i f t e 1 1 1 f r 

YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
100 110 120 130 140 150 



m02 5 -pep 
9025 



100 110 120 130 140 150 

m025 . pep KAAAVKS RPAVPAAAQPPVQSAPVD I NAATHT I VRGDTVYNI S KRYH I S QDD FRAWNGMT 

I 

g025 K 



160 170 180 190 200 210 

m025 . pep DNMLS I GQ I VKVKPAG YAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

I ill Ml i I I llhi i MM i ! I Ml I I I Ml I I M I ! I 

g025 TAAVESRPAVPAAAQTPVKPAAQPPVQSAPQPAAPAAEN 

160 170 180 190 

220 230 240 250 260 

m025 . pep KAVPAPAP- -QSPAASPSGTRSVGGIVWQRPTQGKWADFGGNWKGVDIAGNAGQPVLAA 

I M I M M M I M M NMM I I III I M | M M M I M IM 1 1 I M i I M ; i ! I I I 

g 0 2 5 KAVP AP AP APQS PAAS PSGTRSVGG I VWQRPTQGKWADFGGGNKGVD I AGNAGQ PVLAA 

200 210 220 230 240 250 

270 280 290 300 310 320 

TO025 . pep ADGKWYAGSGLRGYGNLVI IQHNSS FLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

I I I I I I I I I I I I i I I I I I I 1 I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | 
g025 ADGKWYAGSGLRGYGNLVI IQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

260 270 280 290 300 310 



330 340 350 

m02 5 . pep RTQLHFEVRQNGKPVNPNSYIAFX 
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r 1 1 1 1 1 1 f i r i i 1 1 r 1 1 1 1 1 1 1 1 1 

g02 5 RTQLHFEVRQNGKPVNPNSYIAFX 
320 330 

The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 1 13>: 

g031 . seq 

1 ATGGTGTCCC TCCGCTTCAG ATTCGGCAAC CACTTTAAAC GCCGACATTC 

51 TGACAATTTC CTTTTCCGCC AGCCAAATAT CATGCGTATC TTTCGGTTCG 

101 GGCTTGTTGG GCATGGCAAC CTTCAACAGC CGCGCCATCA CAGGAATCGT 

151 CGTTCCCTGA ATCAGCAGCG ACAGCACCAC CACGGCAAAC GCCACATCAA 

201 ACAGCAGGTG CGAATTGGGA ACGCCCATCA CCAGCGGCAT CATCGCCAGC 

251 GAAAT CGGTA CGGCTCCTCG CAAGCCCAAC CAACTGATAT ACGCCTTTTC 

301 ACGCAGGCTG TAATTGAATT TCCACAAACC GCCGAACACT GCCAGCGGAC 

351 GCGCGACCAG CATCAGGAAC GCCGCAATCG CCAAGGCTTC CGCCGCCCTG 

4 01 TCCAACACGC CGGCGGGAGA AACCAGCAGA CCGAGCATGA CGAACAAAGT 

4 51 TGCCTGCGCC AGCCAAGCCA AACCGTCCAT CACACGCAAA ACGTGTTCCG 

501 TcgcACGGTT GCGCTGGTTA CCGACAATGA TGCCGGCAAG GTAAACCGCC 

551 AAAAAGCCGC TGCCGCCTAT GGTATTGGTA AACGCAAACA CAAGCAGCCC 

601 GCCCGACACA ATCATCAGCG CG T AC AG AC C TTCCGtacac acctccaatt 

651 cccaatcaac gtcatagctg tctcccgtgt taaaatgttc ttcacttcag 

701 aatccccccc ttcttcccag cccgaaacct tcatgtgtta naccctgggg 

751 tgccccaacg gatttagtaa cctcccaatg actctgcttg tcgccccctt 

801 cgcccgcttt ctccttccgg gaaaacttgt tgtccccgtc ttacattaa 

This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 031.ng>: 

g031 .pep 

1 MVSLRFRFGN HFKRRHSDNF LFRQPNIMRI FRFGLVGHGN LQQPRHHRNR 

51 RSLNQQRQHH HGKRHIKQQV RIGNAHHQRH HRQRNRYGSS QAQPTDIRLF 

101 TQAVIEFPQT AEHCQRTRDQ HQERRNRQGF RRPVQHAGGR NQQTEHDEQS 

151 CLRQPSQTVH HTQNVFRRTV ALVTDNDAGK VNRQKAAAAY GIGKRKHKQP 

201 ARHNHQRVQT FRTHLQFPIN VIAVSRVKMF FTSESPPSSQ PETFMCXTLG 

251 CPNGFSNLPM TLLVAPFARF LLPGKLV VPV LH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 15>: 

m031 . seq (partial) 

1 ... CGCCTGAAGC ACGGTGTCGG ACTGCATTTC TATTCGGCTA TACGCCTTTT 

51 CACGCAGGCT GTAATTGAAT TTCCACAAAC CGCCGAACAC TGCCGACGGA 

101 CGCGCGACCA GCATCAGGAA CGCCGCAATC GCCAAgGCTT CCGCCGCCCT 

151 GTCCAACACG TTGGCAGGAG AAACCAGCAG CAAAGGCATT CCCAAACGTG 

201 CGGACAAAGT GGTCGAAACC ACGCTCAGAA ACAACAGTGC GCCACCCGGC 

251 AG 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 03 1>: 

m031.pep (partial) 

1 . . . RLKHGVGLHF YSAIRLFTQA VIEFPQTAEH CRRTRDQHQE RRNRQGFRRP 
51 VQHVGRRNQQ QRHSQTCGQS GRNHAQKQQC ATRQ. . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

a031 . seq 

1 ATACGCCTTT TCACGCAGGC TGTAATTGAA TTTCCACAAA CCGCCGAACA 

51 CTGCCGGCGG ACGCGCGACC AGC AT CAGG A ACGCCGCAAT CGCCAAGGCT 

101 TCCGCCGCCC CGTCCAACAC GTTGGCAGGA GAAACCAGCA GCAAAGGCAT 

151 TCCCAAACGT GCGGACAAAG TGGTCGAAAC CACGCTCAGA AACAACAGTG 

201 CGCCACCCGG CAG 

This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 031.a>: 

a031.pep (partial) 

. 1 IRLFTQAVIE FPQTAEHCRR TRDQHQERRN RQGFRRPVQH VGRRNQQQRH 
51 SQTCGQSGRN HAQKQQCATR Q 
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m03i/a03i 100.0% identity over a 71 aa overlap 

10 20 30 40 50 60 

m031 . pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

I 1 I I I I I I I I I II It I I I I I I II I I M I I I I I I I I I I II I II I I I I I 
a031 IRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

10 20 30 40 

70 80 
m031 - pep QRHSQTCGQSGRNHAQKQQCATRQ 
II I I I I I I I I I I II I I I I I I I I I I 
a0 3 1 QRHSQTCGQSGRNHAQKQQCATRQ 
50 60 70 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 03 1 shows 60.0% identity over a 85 aa overlap with a predicted ORF (ORF 03 1 .ng) 
from N. gonorrhoeae: 

m031/g031 

10 20 30 

m031 .pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEH 

I : : I : : I II I I I I I I I I I I I I I I 

q031 NQQRQHHHGKRHIKQQVRIGNAHHQRHHRQRNRYGSSQAQPTDIRLFTQAVIEFPQTAEH 
60 70 80 90 100 110 

40 50 60 70 80 

m031 . pep CRRTRDQHQERRNRQGFRRPVQHVGRRNQQQRHS-QTCGQSGRNHAQKQQCATRQ 
I : I I I I I I I I I I I II I I I I I I I I : I I I I I : I : I : I : : : : : : I : I : 
g031 CQRTRDQHQERRNRQGFRRPVQHAGGRNQQTEHDEQSCLRQPSQTVHHTQNVFRRTVALV 
120 130 140 150 160 170 

g031 TDNDAGKVNRQKAAAAYGIGKRKHKQPARHNHQRVQTFRTHLQFPINVIAVSRVKMFFTS 
180 190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 19>: 

g032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTA TTGCGCCGCC CACGATTCGA 

51 GGCGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAAGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGACGC TGCTTGCGCC 

201 CTTTGCCGGT AACGTGTACC CACGCTTCGT C C AAAT AT AC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGCTC 

301 GAACAGCGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAACAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGCGCATCAG 

4 51 CCCGCGTTTG ACCAGCCAGG CGC AATACTG CCGCCACGCC GACAGCTTGC 

501 GCGCCAGCGT CCGACCGTCC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GccgTAAAAT CGCGCCGCGA CAAGTCCTGC GGCACGCcgc ctgcaTCTTC 

601 AG AC G G C AT T TGTGCCAACA GTGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCAAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

7 01 TAAGCGTCAA AATacgccgC AAACccgTCC AAAACCATAA CCGTCCCACA 

7 51 CAAATATCAA AAAACCAGTG A 

This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 032.ng>: 

g032 .pep 

1 MRRNVPAVAV LRRPRFEAFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLTLLAPFAG NVYPR FVQIY IICIQAVYLA HA QTAAVHQL 



WO 99/57280 



PCT/US99/09346 



211 



101 EQRVVAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGAHQ 

151 PAFDQPGAIL PPRRQLARQR PTVQTALRQP PQRRRKIAPR QVLRHAAC I F 

201 RRHLCQQCKQ FFQIAPVCRN RVLRLALAHD VFQISVKIRR KPVQNHNRPT 

251 QISKNQ* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

m032 . seq (partial ) 

1 ATGCGGCGAA ACGTGCmTGC mGTCGCCGTT kTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGACCAGCGG CACCTGCCGC TgTT . GCGCC 

201 CTTTGCCGAT AAcGTGTACC CACGCyTCGT CCAAATAGAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGGGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

3 51 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGTGCATCAG 
4 51 GCCGCGCTTT AC C AG C C AAA CGCAATACTG CCGCCAAGAC GAAAGCTTGC 
501 GAGCCAGCGT CCGTTCCCCC AAACCGCG... 



This corresponds to the amino acid sequence <SEQ ID 112; ORF 032>: 

m032.pep (partial) 

1 MRRNVXAVAV XRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFADQR HLPLXAPFAD NVYPR XVQID IICIQAVYLA HA QTAAVHQF 

101 EQGVVAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGVHQ 

151 AALYQPNAIL PPRRKLASQR PFPQTA. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

a032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTT TTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGCCGC TGCTTGCGTC 

2 01 CTTTGCCGGT AACGTGTACC CACGCCTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGCGCG TGATCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG TAT G C AG C AG 

4 51 ACCGCGTTTG AC C AG C C AG G CGCAATACTG CCGCCAAGAC GACAGCTTGC 

501 GCGCCAGCGT CCGCGCATTC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GCCGTAAAAT CGCGCTGCGA CAAGCCCTGC GGCACGCCGC CTGCATCTTC 

601 AGACGGCATT TGTGCCAACA GCGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCCAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

7 01 TAAGCGTCAA AATGCGCCGC AAACCCGTCC AAAACCATAA CCGCCCCACA 

7 51 CAAATATCAA AAAAACAGTG A 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 032.a>: 

a032.pep 

1 MRRNVPAVAV LRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NL PL LAS FAG NVYPR LVQIY IICIQAVYLA HA QTAAVHQF 

101 EQRVIAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGMQQ 

151 TAFDQPGAIL PPRRQLARQR PRIQTALRQP PQRRRKIALR QALRHAACIF 

201 RRHLCQQRKQ FFQIAPVCRH RVLRLALAHD VFQISVKMRR KPVQNHNRPT 

251 QISKKQ* 



m032/a032 88.1 % identity over a 176 aa overlap 

10 20 30 40 50 60 

m032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQI VFQGFHAFADQR 

I I I I I Ml I I I I I I I | f | | | M | | | | | | | j | | | | | | | | | i | | | | | | | , 

a032 MRRNVPAVAVLRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m032 . pep HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGVVAHRQRVAAVHGQIQH 
: I I I t I I I II 1 I I I I I I I I I I I I I I I II II I I I I I II I : I I I I I I I I I I I I I M 
a032 NLPLLASFAGNVYPRLVQIYIICIQAVYLAHAQTAAVHQFEQRVIAHRQRVAAVHGQIQH 

70 80 90 100 110 120 

130 140 150 160 170 

m032 . pep PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
I I M I II II M I I I M I I I I II I I II I : : I : I : I I : I II I I I ! : I I III Ml 
a032 PVQPFLRQG FGYALGLLRRFDVGGRVGMQQTAFDQPGAILPPRRQLARQRPRIQTALRQP 

130 140 150 160 170 180 

a032 PQRRRKI ALRQALRHAACI FRRHLCQQRKQFFQIAPVCRHRVLRLALAHDVFQI SVKMRR 

190 200 210 220 230 240 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 032 shows 86.4% identity over a 176 aa overlap with a predicted ORF (ORF 032.ng) 
from N. gonorrhoeae: 

m032/g032 

10 20 30 40 50 60 

MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFADQR 
I I I I I I I I I III : : I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I II 
MRRNVPAVAVLRRPRFEAFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 
10 20 30 40 50 60 

70 80 90 100 110 120 

HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGVVAHRQRVAAVHGQIQH 
: I I I I I I I I I II III I I I I I I I I I I I I I I I I II I : I I I I I I I I I I I I I I I I I I I 
NLTLLAPFAGNVYPRFVQIYIICIQAVYLAHAQTAAVHQLEQRVVAHRQRVAAVHGQIQH 
70 80 90 100 110 120 

130 140 150 160 170 

PVQPFLRQG FGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
I I I M M I I I I I I I I I I I I I I I I I I I I : II I : II : I M I I I I : I I III III 
PVQPFLRQG FGYALGLLRRFDVGGRVGAHQPAFDQPGAILPPRRQLARQRPTVQTALRQP 
130 140 150 160 170 180 

PQRRRKI APRQVLRHAACI FRRHLCQQCKQFFQIAPVCRNRVLRLALAHDVFQI SVKIRR 
190 200 210 220 230 240 



m032 . pep 
g032 

m032 -pep 
g032 

m032 . pep 
g032 

g032 



The following partial DNA sequence 



g033 . seq 






1 


ATGGCGGCGG 


CGGACAAACT 


51 


CATCGGAGAC 


GGCGCGATGA 


101 


GCGCGGGCGA 


TATGGATGTG 


151 


ATGTCGATTT 


CCCCCAACGT 


201 


CGTCGTGCGC 


GATATGCACG 


251 


GCAAGGTATT 


AGACAAAATA 


301 


GAACAcaaaA 


TCAAAACCCT 


351 


GCTGTCGCTG 


TTTGAAAATT 


401 


GACACAACGT 


CGAGAATCTG 


451 


AAAGGCCCTC 


AGTTGCTGCA 


501 


ACTCGCCGAA 


AACGACCCcg 


551 


AAGAAGGCGG 


GGCGCAAATg 


601 


aaaccgACCT 


ATACCCAAGT 


651 


GGCAGATTCC 


CGACTGGTTG 



identified in N. gonorrhoeae <SEQ ID 1 15>: 

CTTGGGCGGC GACCGCCGCA GCGTCGCCAT 
CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 
GATTTGCTGG TCGTCCTCAA CGACAACGAA 
CGGCGCGTTG CCCAAATATC TTG C CAG CAA 
GACTGTTGAG TACCGTCAAA GCGCAAAcgg 
CCCGGCGCGA TGGagtTTGC CCAAAAAGTC 
TGCCGAAGAA GCCGAACACG CCAAACAGTC 
TCGGCTTCCG CTACACCGGC CCCGTGGACG 
GTGGACGTAT TGAAAGACTT GCGCAGCCGC 
CGTCATCACC AAAAAGGGCA ACGGCTACAA 
t c a AATACCA CGCCGTCGCc aACCTGCcta 
ccGTCTGAAA AAGAACC CAA GCCCGCCgCc 
ATTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
CGATTACCCC CGCCATGCGC GAGGGCAGCG 
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701 GACTGGTGGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

751 ATCGCCGAGC AGCACGCCGT tacCTTTGCC GGCGGTTTGG CGTGCGAAGG 

8 01 CATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGTG CGGGCATCGT CGGCGCGGAC GGTCCGACCC ATGCCGGCTT 

951 GTACGATTTG AGCTTCTTGC GCTGTGTGCC GAACATGATT GTTGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCG 

1051 GATGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCC TTcatTGCCT TCGGCAGTAT GGTCGCCACC 

12 01 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTt 
1251 cgtcaaacCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCAcg 

13 01 accGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

13 51 GCGGTCTTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGAACA CGGCGATCCG AAAAAACTTT 

14 51 TGGACGATTT GGGTTTGAGT GCCGAAGCGG TGGAACGCCG GGTGCGCGAG 

15 01 TGGCTGCCGG ACCGTGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 116; ORF 033.ng>: 

g033 .pep 

1 MAAADKLLGG DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAME FAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE ND P VKYHAVA NLPKEGGAQM PSEKEPKPAA 

2 01 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

2 51 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHD I ALQNLPVLFA 
301 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

3 51 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG I IRREGE KTA FIAFGSMVAT 
401 ALAVA EKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE EKTAEQGGAGG 

4 51 AVLEVLAKHG I CKPVLLLGV ADTVTEHGDP KKLLDDLGLS AEAVERRVRE 
501 WLPDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

m033 . seq 

1 ATGGCGGCGG CAGACAAACT CTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CAT CGGCGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCaG . CGA TATGGATGTr GATTTGCTrG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGCTG CCGAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GCCTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

3 01 GAACACAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

3 51 GCTGTCTTTG TTTGAAAACT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 
451 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

5 01 ACTCGCCGAA AACGACCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 
551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 
601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

6 51 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 
751 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 
8 01 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 
8 51 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTACCCGT TTTGTTTGCC 
901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTCT 
951 GTACGATTTG AGCTTTTTGC GCTGCGTGCC GAACATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGTTGC TTTCGACCTG CTAT CAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCC 

12 01 GCATTGGCGG TTG C CGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

12 51 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 
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13 01 ACCGCATCGT TACCCTTGAA 

13 51 GCGGTGCTGG AAGTATTGGC 

14 01 TTTGGGCGTT GCCGATACCG 

14 51 TAGACGATTT GGGCTTGAGT 

15 01 TGGCTGTCGG ATCGGGATGC 



GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

TAACCGGACA CGGCGATCCG AAAAAACTTT 

GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 
GGCAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 118; ORF 033>: 

m033 .pep 

1 MAAADKXiLGS DRRSVAIIGD gamtagqafe alncaxdmdv dllwlndne 

51 msispnvgal pkylasnwr dmhgllstvk aqtgkvldki pgamefaqkv 

101 EHKIKTLAEE aehakqslsl fenfgfrytg pvdghnvenl vdvlkdlrsr 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

301 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG IIRREGE KTA FIAFGSMVAP 

4 01 ALAVAEKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

451 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AE AVE RRVRA 

501 WLSDRDAAN* 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 1 19>: 

a033 . seq 

1 ATGGCGGCGG CGGACAAACA GTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGTCA GGCGTTTGAA GCCTTGAACT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGTGCGTTG CCCAAATACC TTGCCAGCAA 

2 01 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

301 GAACATAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 ACTGTCTTTG TTTGAAAACT TCGGCTTCCG CTATACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTCGATGTAT TGGAAGACCT GCGCGGACGC 

4 51 AAAGGCCCGC AGCTTCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGATCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 

551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA. GCCCGCCGCC 

601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 

801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTTT 

951 GTACGATTTA AGCTTTTTGC GCTGCATTCC GAATATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAATGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGTGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCT 

1201 GCATTGGCGG TCGCCGGAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 

1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCAGC 

1351 GCGGTGCTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTCTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

14 51 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 120; ORF 033.a>: 

a033 . pep 

1 MAAADKQLGS DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLEDLRGR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 
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301 VDRAGIVGAD GPTHAGLYDL SFLRCIPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGVPVSD GMETVEIGKG IIRREGEK TA FIAFGSMVAP 

4 01 ALAVAGKLNA TVADMRFVKP I DEELIVRLA RSHDRIVTLE ENAEQGGAGS 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 

501 WLSDRDAAN* 

m033/a033 98.4% identity over a 509 aa overlap 

10 20 30 40 50 60 

m033.pep MAAADKLLGSDRRSVAIIGDGAMTAGQAFEALNCAXDMDVDLLVVLN DNEMSISPNVGAL 

I I M I I I I I M I I I I I I I I I I M It I I I I I I I I I I I I I I I I I I M I I I M I I I M I I I 
a 0 3 3 MAAADKQLGS DRRSVAI I GDGAMTAGQAFEALNCAGDMDVDLLVVLN DNEMSIS PNVGAL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m033 . pep PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 
I I I I I I I M I I I I I M I I I I I I t I M I I It ! I I I II I t M I I I It I M I M I M ! I M I I 
a033 PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m033 . pep FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 
I I M M I I I I t II I I I I I I I t I t I : f M : I I I t I I I I I t I I t I I t I | | I | | | I I I I I I I I 
a033 FENFGFRYTGPVDGHNVENLVDVLEDLRGRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m033 . pep NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 
t I I I I i I t I I I I I I I I I M I I I I I I I I I II II I I t I I I I I I I | I | | t I I II II I I I M I I 
a033 NLPKESAAQMPSEKE PKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

m033 . pep RFPDR Y FDVGI AEQHAVT FAGGLACEGMKPVVAI YS T FLQRAYDQLVHD I ALQNL PVL FA 

I I I I I I I I I M I I I II f I I I t 1 I I I I I I I I I I I I I I I I I M I I I I I I I I | | | M I I I I I I 
a033 R FP DRY FDVGI AEQHAVT FAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNL PVL FA 

250 260 270 280 290 300 

310 320 330 340 350 360 

m033 . pep VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 
I I I I I I I M I I I I I I II I I I I I I I I : 1 I I I I I I M I II I I I ! I I I M II I I I I I I t M I I 
a033 VDRAGIVGADGPTHAGLYDLSFLRCIPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

m033 . pep GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 
I I I I I : I I f It I I I I I I I M I I I I I I I I I I I I I I II I | | | | | | | | I I I I I I I I I I I I I I 
a033 GTGTGVPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAGKLNATVADMRFVKP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m033 . pep IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 
I E I I I I I I I t I I! I I I I I I I I I I I II I I f : I I I I I I I I I E I t I I I I I I I I I I II I I I I I I 
a033 IDEELIVRLARSHDRIVTLEENAEQGGAGSAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

430 440 450 460 470 480 

490 500 510 

m033 .pep KKLLDDLGLS AEAVERRVRAWLSDRDAANX 

I I I I I I I I I I I I I I I 1 I I I I I I M I I I I I I 
a033 KKLLDDLGLS AEAVERRVRAWLSDRDAANX 

490 500 510 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 033 shows 98.4% identity over a 509 aa overlap with a predicted ORF (ORF 033.ng) 
from N. gonorrhoeae: 



m033/g033 






m033 


.pep 


m/\/\/ujisJ_iijtjt)JJKKS3 VAl l ojjuAM J, J\\j(jj\r hiALNCAADr^VJJLLVVLNDNEMS I SPNVGAL 

1 IM 1 1 1 IM! Ill lUMIMIIMM 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 M II 1 

MAAADKLLGGDRRS VAI I GDGAMTAGQAFEALNCAGDMDVDLLWLNDNEMS I S PNVGAL 


60 


g033 




60 


m033 


.pep 


FKYLiAi>N V VKUMHOijljb I V KAyTCjKA/XiDKI PGAMEFAQKVEHKI KTLAEEAEHAKQSLSL 


120 


g033 




IIMIIMIIIIIIIIIIIMIIIMIMIIIIIIMIMMIIIillMMIIIIIIII 

PKYLASNWRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTLAEEAEHAKQSLSL 


120 


m033 


.pep 


FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

IIIIIIIIIIIIIIMIIIIIIMIIMIIIIilllMIIIIIMIIIIMIIMIMM 

FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 


180 


g033 




180 


m033 


.pep 


NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

M 1 M ^ 1 1 ! M M 1 1 1 11 M 1 II 1 1 M 1 M 1 j 1 M i I M 1 1 1 1 1 1 1 M 1 M 1 ! 1 1 i |{ 1 

NLPKEGGAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 


240 


g033 




240 


m033 


*pep 


RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 

Ml III II 1 II IM III MMMM Mill IMMIIMM IMIMM I MIMIIMI 

RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 


300 


g033 




300 


m033 . 


.pep 


VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

1 M IN 1 M 1 1 1 1 1 1 1 1 1 M M,il MMM Mill II M 1 m M 1 1 M M 1 M M 1 II 1 

VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 


360 


g033 




360 


m03 3 . 


.pep 


GTGTGAPVSDGMEWEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 

1 M Ml 1 iMM 1 1 M MIM MMMM Mi MIIIMI 1 II 1 1 1 1 M 1 1 1 1 M 1 1 M 1 

GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVATALAVAEKLNATVADMRFVKP 


420 


g033 




420 


m033 . 
g033 


.pep 


IDEELIVRIiARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

IM II 1 MIM MM IM MMIIIMIII MMMMMMIMMMM MM 1 MM 

IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTEHGDP 


480 
480 


m033 . 


pep 


KKLLDDLGLSAEAVERRVRAWLSDRDAANX 510 

i 1 1 II MM M IM 1 II 1 1 II MIMM 

KKLLDDLGLSAEAVERRVREWLPDRDAANX 510 




g033 







The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 121>: 

g034.seq 

1 ATGAGCCGTT TATGGTTTTT 

51 TTACCTATTG CCCAAGGAGA 

101 TGCTTGACCA CGCCGCCGAA 

151 AACAACCTCG AACAAATGCG 

201 CGCGCCCGTC ATCGTACAGG 

251 CGCCGTTTTT GCGCCACCTG 

3 01 ATCCCCGTCG TGATGCACCA 

3 51 ACGCTCCATC CAACTGGGCT 

4 01 TCGAAGACGG CAAAACCCCT 
4 51 CGTACCGTCG TCAACTTCTC 
501 AATCGGCGTA TTGGGCAACC 
551 GAGTGGGCGC GGCAGGCAAA 
6 01 GAAGATGCCG TGCGTTTCGT 
651 TGCCGTCGGC ACCAGCCACG 



TGCCGTAAAA 
CACAAATGGC 
AACAGCTACG 
CGCCATTATG 
CG AG CGCAGG 
ATTCTGGCGG 
AGACCACGGC 
TCTCCTCCGT 
TCTTCTTACG 
CCACGCCTGC 
TCGAAACCGG 
CTCTCACACG 
T AAAG AT AC C 
GCGCATACAA 



AACATTATAA 
ACTCGTATCC 
GCCTGCCCGC 
GAAGCCGCCG 
TGCGCGCAAA 
CAGTCGAAGA 
GCATCGCCCG 
GAT G ATGGAC 
AATACAACGT 
GGCGTGTCCG 
CGAAGCAGGC 
ACCAAATGCT 
GGCGTTGACG 
ATTCACCCGT 



TCCGCCTTAT 
ATGCGCCAAC 
GTTCAACGTC 
ACCAAGTCAA 
TACGcggGCG 
ATTTCCGCAC 
ACGTgtgCCA 
GG CTCTTTGC 
CAACGCCACC 
TCGAAGGCGA 
GAAGAAGACG 
CACCAGCGTT 
CATTGGCGAT 
CCGCCCACAG 
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701 GCGACGTATT GCGTATCGAC 

751 AATACACACA TCGTGATGCA 

8 01 GAAAGTCATC AACGAATACG 

851 CGGTTGAAGA AATCGTCGAA 

901 ATCGATACCG ACCTGCGCCT 

951 TGCCGAAAAC CCGTCCGACT 

1001 TTGAAGCGAT GAAGCAAATC 

1051 GAAGGTCAGG CAGG CAAAAT 

1101 CCGTTATGCC AAGGGCGAAT 



CGCATCAAGG AAATCCACCA AGCCCTGCCC 
CGgctCCAGC TCCGTTCCGC AAGAatgGCT 
GCGGCAATAT CGGCGAAACC TACGGCGTGC 
GGCATCAAAC ACGGCGTGCG CAAAGTCAAC 
CGCTTCCACC GGCGCGGTAC GCCGCTACCT 
TTGATCCGCG CAAATACTTG GGCAAAACCA 
TGCCTCGACC GTTATCTTGC GTTCGGTTGC 
CAAACCTGTT TCG TTGG AAA AAATGGCAAG 
TGAACCAAAT CGTCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 122; ORF 034.ng>: 

g034 .pep 

1 MSRLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHA AE NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV IVQASAGARK YAGAPFLRHL ILAAVEEFPH 

101 I PWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLLEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAAGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTH I VMHGS S SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL GKTIEAMKQI CLDRYLAFGC 

351 EGQAGKI KPV SLEKMASRYA KGELNQIVK* 



The following partial DNA sequence 



m034 . seq 


(partial) 




1 


ATGAGCTGTT 


TATGGTTTTT 


51 


TTACCTATTG 


CCCAAGGAG A 


101 


TGCTTGATCA 


TGCTGCCGAA 


151 


AACAACCTCG 


wACAGATGCG 


201 


CGCCCCCGTC 


ATCGTACAGG 


251 


CGCCGTTTTT 


ACGCCACCTG 


301 


ATCCCCGTCG 


TCATGCACCA 


351 


ACGCTCCATC 


CAACTGGGCT 


401 


TGGAAGACGG 


CAAAACCCCT 


451 


CGTACCGTGG 


TTAACTTCTC 


501 


AATCGGCGTA 


TTGGGCAACC 


551 


GTGTAGGCGC 


AGTGGGCAAA 


601 


GAAGATGCCG 


TATGTTTCGT 


651 


TGCCGTCGGC 


ACCAGCCACG 


701 


GCGATGTATT 


ACGTAT CGAC 


751 


AATACACACA 


TCGTGATGCA 



; identified in N. meningitidis <SEQ ID 123>: 

TGCTGTAAAA AACATTATAA TCCGC CTTAT 
CACAAATGGC ACTCGTATCC ATGCGCCAAC 
wACAGCTACG GCyTGCCGGC GTTCAACGTC 
CGCCATCATG GAGGCTGCAG AC CAAGTCGA 
CGAGTGCCGG TGCGCGCAAA TATGCGGGTG 
ATTTTGGCGG CTGTCGAAGT ATTTCCACAC 
AGACCACGGC GCATCACCCG ACGTGTGCCA 
TCTCCTCTGT AATGATGGAC GGCTCGCTGA 
TCTTCTTACG AATACAACGT CAACGCCACA 
CCACGCTTGC GGCGTATCCG TTGAAGGCGA 
TCGAAACCGG CGATGCAGGC GAAGAAGACG 
CTTTCCCACG AC CAAATG CT GACCAGCGTC 
TAAAGATACC GGCGTTGACG CATTGGCTAT 
GCGCATACAA ATTCACCCGT CCGCCCACAG 
CGC AT CAAAG AAATCCACCA AGCCCTGCCC 
C. . . 



This corresponds to the amino acid sequence <SEQ ID 124; ORF 034>: 

m034.pep (partial) 

1 MSCLWFFAVK Nil IRLIYLL PKETQMALVS MRQLLDHA AE XSYGLPAFNV 

51 NNLXQMRAIM EAADQVDAPV IVQASAGARK YAGAPFLRHL I LAAVEVFPH 

101 I PWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGDAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVCFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

2 51 NTHIVMH. . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 125>: 

a034 . seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGCAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGCGGGCG 

2 51 CGCCGTTTTT GCGCCACCTG ATTTTGGCGG CTGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TTTCCTCCGT GATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTATG AATACAACGT CAACGCCACC 
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4 51 CGTACCGTGG TTAATTTCTC CCACGCCTGC GGCGTATCCG TTGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC TCGAAACTGG CGAAGCCGGC GAAGAAGACG 

551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT CACCAGCGTC 

601 GAAGATGCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 

651 TGCCGTCGGC ACCAGCCACG GCGCGTACAA ATTCACCCGT CCGCCCACAG 

7 01 GCGACGTGTT GCGTATCGAC CGCATCAAAG AAATCCACCA AGCCCTGCCC 

7 51 AATACACACA TCGTGATGCA CGGCTCCAGC TCCGTTCCGC AAGAATGGCT 

801 GAAAGTCATC AACGAATACG GCGGCAATAT CGGCGAAACC TACGGCGTGC 

851 CGGTTGAAGA AATCGTCGAA GGCATCAAAC ACGGCGTGCG TAAAGTCAAC 

901 ATCGATACCG ACTTGCGCCT TGCTTCCACC GGCGCGGTAC GCCGCTACCT 

951 TGCCGAAAAC CCGTCCGACT TCGATCCGCG CAAATATTTG AGCAAAACCA 

1001 TTGAAGCGAT GAAGCAAATC TGCCTCGACC GCTACCTCGC GTTCGGTTGC 

1051 GAAGGTCAGG CAGGCAAAAT CAAACCGGTT TCCTTGGAAA AAATGGCAAA 

1101 CCGTTATGCC AAGGGCGAAT TGAACCAAAT CGTCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 126; ORF 034.a>: 

a034 .pep 

1 MSRLWFFAAK NIIIRLIYLL PKETQMALVS MRQLLDHAA E NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV IVQASAGARK YAGAPFLRHL ILAAVEEFPH 

101 IPVVMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTVVNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVRFVKDT GV DALAI AVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL SKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMANRYA KGELNQIVK* 

m034/a034 96.9% identity over a 257 aa overlap 

10 20 ' 30 40 50 60 

m03 4 .pep MSCLWFFAVKNIIIRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 
M I I II I : I I I I I I I I M I I I M M I I I I I I I I I I I I I I MINIMUM I M M I 
a034 MSRLWFFAAKNIIIRLIYLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 

10 20 30 40 50 60 

70 80 90 100 110 120 

m034 . pep EAADQVDAPVIVQASAGARKYAGAPFLRHLILAAVEVFPHIPVVMHQDHGASPDVCQRSI 
I I I I M : I I I ! I I I I I I I I I II I I I I I I I II ! I I M II I I j I I I I I I I I | M II I I I M 
a034 EAADQVNAPV IVQASAGARKYAGAPFLRHL ILAAVEEFPH I PVVMHQDHGASPDVCQRS I 

70 80 90 100 110 120 

130 140 150 160 170 180 

m034 .pep QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGDAG 
I M I I I I I i I I I I I I I I M I II I i I M I I I I I I I I I I I I I I I I I M I I I I I I M I I I : I f 
a034 QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGEAG 

130 140 150 160 170 180 

190 200 210 220 230 240 

IU03 4 . pep EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 
M I I M I I I I I I M I I I M I I I I I I I I I I I [ II j I [ I I I I I I I | | | || | | | | | | | | | U 
a034 EEDGVGAVGKLSHDQMLTSVE DAVR FVK DTGV DALA I AVGTSHGAYK FT R PPTGDVLRID 

190 200 210 220 230 240 

250 

m03 4 .pep RIKEIHQALPNTHIVMH 
I I I I M II I I I I I I I I I 

a034 RIKEIHQALPNTHI VMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 

250 260 270 280 290 300 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 034 shows 96.5% identity over a 257 aa overlap with a predicted ORF (ORF 034.ng) 
from TV. gonorrhoeae: 
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m034/g034 






m034 
g034 


.pep 


MSCLWFFAVKNI I IRLI YLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 | | | | | | 
MSRLWFFAVKNI I IRLI YLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 


60 
60 


m034 


.pep 


EAADQVDAPVI VQASAGARKYAGAPFLRHLILAAVEVFPHI PWMHQDHGAS PDVCQRS I 

II 1 1 1 MM MM IM M MIMi 1 II 1 Ml II Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 

E AADQVNAP V I VQ AS AG ARKY AGAP FLRHL I LAAVEE FPHI PWMHQDHGAS PDVCQRS I 


120 


g034 




120 


m034 


.pep 


QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGDAG 
1 1 1 1 1 1 1 II 1 1 1 h 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 : II 
QLGFSSVMMDGSLLEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGEAG 


180 


g034 




180 


m034 , 


.pep 


EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

IMM MM (M M 1 1 IMMIII M IMMIMMIMM Ml Ml 1 Ml Ml 1 IMM 

EEDG VGAAGKLSHDQMLTS VEDAVRFVKDTGVDALAI AVGTSHG AYKFTRP PTGDVLR I D 


240 


g034 




240 


m034 . 


.pep 


RIKEIHQALPNTHIVMH 

IIIIIIIIIMMIMI 

RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 


257 


g034 




300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 127>: 

g036.seq 

1 ATGCTGAAGC CGTGTTTGGT ATACAGTGCC TGTGCGGCGG cgttgcCTGC 

51 GCGGACTTCG AGCAGCAGGC GTTGCGTGCC TTCGGGCAGA TGTGCGTACC 

101 AATATTCGAG CAGGGCGGAC GCAACGCCCC GTCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCTGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACAAG CGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGTAT CGAGCGCGGC CAGTGCGGCG CAGTCGGACG GTGAGGCTGG 

351 GCGGATGTTC ATGTTCGTGC CTTCCGTTCC GCCTGTTCTT TGGCAGTCAG 

401 GGCGATTTTG TTGCGGACGT AGAGCAGTTC GGCGTGTGCC GCGCCAGTTG 

4 51 CGGGATAGCC GCCGCCGAGG GCGAGCGCGA GAAAATCGGC GGCGGTCGGC 

501 ATATCGGGTT T G C CT GAG AA GGGCGGACGG TTTTCCAGTG CGAACGCACT 

551 GCCGATGCCG TCTGAAAAGA CGTACCCCTC GGGGAGGGCA ATGTCTGCCG 

601 CCCTACCGAC TTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CACGCATAAA ACACTTCGCC CATACGCGCG TCCGCAGCGG CGAGTATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGTG GGGATGCCGA 

751 TTAAAGGCGT GTCGAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 128; ORF 036.ng>: 

g036 .pep 

1 MLKPCLVYSA CAAALPARTS SSRRCVPSGR CAYQYSSRAD ATPRRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA ILPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTVSSAASAA QSDGEAGRMF MFVPSVPPVL WQSGRFCCGR RAVRRVPRQL 

151 RDSRRRGRAR ENRRRSAYRV CLRRADGFPV RTHCRCRLKR RTPRGGQCLP 

201 PYRLDNRSNG GGSACRTTHK TLRPYARPQR RVCSFAAAAA RRRHRAWGCR 

251 LKACRTALPN LAPRRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 129>: 

m036 . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCg . CGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 



WO 99/57280 



PCTAJS99/09346 



220 



3 51 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 
4 51 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 
501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT TCTTCCAGCG CGAACGCGCT 
551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 
601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 
651 CAT G CAT AAA ACACTTCGCC CATACGAGCG TCCGTAGCGG CAAGGATGCA 
701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC AT C GAG C GAG GGTACGCCGA 
7 51 TTAAGGGGGT ATCAAACGGC GTTGCCAAAC CCTGAGCTAC ACCGATGCCG 
801 ATACGCAGTC CGGTAA 



This corresponds to the amino acid sequence <SEQ ID 130; ORF 036>: 

m036 .pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPXSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 

201 PARPDNRSNG GSSAYRTMHK TLRPYERP*R QGCSFAAAAA RRRHRARVRR 

251 LRGYQTALPN PELHRCRYAV R* 



The following partial DNA sequence was 

a036. seq 

1 ATGCTGAAGC CGTGCGCCGT 

51 ACGGACTTCG AGCAGCAGGC 

101 AAT AT T CG AG CAGGGCGGAC 

151 GTGGCAATCA GGTGCAGTTC 

201 AAAGGCGGCA ATCCCGCCGT 

251 GCGAAACCAG TGCGGACTCA 

301 C AG ACG G CAT CGAGCGCGGC 

351 GCGGATGTTC ACAGGCGCGC 

4 01 GGCGATTTTG TTGCGGACGT 

4 51 CGGGAAAACC GCCTTCAGCC 

501 ATATCGGGTT TGCCTGAGAA 

551 GCCGATGCCG TCTGAAAAGG 

601 CCCGACCGAC CTGATAATCG 

651 CATGCATAAA ACACTTCGCC 

7 01 GCTTTGCGGC GGCGGCAGCG 

7 51 TTAAAGGAGT ATCAAACGGC 

801 ATACGCAGTC CCGTAA 



identified in TV. meningitidis <SEQ ID 13 1>: 



GTACAGTGCC 
GTTGCGTGTC 
GCAATTCCTT 
GGATTCGTCG 
CTTTTTCCGC 
AATTGGCGTT 
GAGTGCGGCG 
CCTCCGTTCC 
AGAGCAGCTC 
G C C AG AT T G A 
GGGCGGACGG 
CGCATCCTTC 
CTCAAACGGC 
CATACGTGCG 
AGGCGGCGGC 
GTTGCCAAAC 



TGTGCGGCGG 
TTCGGGCAGA 
GGCGGCGGCA 
GGCAGGTTCT 
AAGGAAAACC 
GCGTCCACGC 
CAATCGGCAT 
GCCTGTTCTT 
GGCGTGTGCC 
GGAAGTCGGC 
TTTTCCAGCG 
CGGCAGCCGG 
GGCGGTTCAG 
TCCGCAGCGG 
ATCGAGCGAG 
CTTGCGCCAC 



TGTTGCCTGC 
TGTGTGAACC 
TTCGGGCGCG 
GCCAAACGAT 
TGTTCGGACG 
GGACGGGTTG 
AAACGGCGCG 
TGGCAGTCAA 
GCAGCGACGG 
GGCGGTCGGC 
CGAACGCATT 
ATGTCTGCCG 
CGTGTCGAAC 
CAAGGATGCA 
GGTACGCCGA 
GCCGATGCCG 



This corresponds to the amino acid sequence <SEQ ID 132; ORF 036.a>: 

a03 6.pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASAA QSA*TARRMF TGAPSVPPVL WQSRRFCCGR RAARRV PQRR 

151 RENRLQPPD* GSRRRSAYRV CLRRADGFPA RTHCRCRLKR RILPAAGCLP 

201 PDRPDNRSNG GGSACRTMHK TLRPYVRPQR QGCSFAAAAA RRRHRARVRR 

2 51 LKEYQTALPN LAPRRCRYAV P* 



m036/a036 85.6% identity over a 270 aa overlap 

10 20 30 40 50 60 

m03 6.pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
M I 1 I I It I I I I I 1 I I 1 I I I I I t 1 I i I ! I I I I I I I I I t I It M I I I I I I M I I I I I I I I I 

a03 6 MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGA VAIRCSSDSS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m036.pep GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I 1 I I II I I ! I I I I I I I It I I I II I I I I I I I 1 I I I I I I I I I I II I I M : : I II I I I I I I 
a03 6 GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASAAQSAXTARRMF 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m03 6. pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
Ml II I I I I I I I I I I I I I I III : I : : I I I I I I : I I I II I I I : : II 
a03 6 TGAPSVPPVLWQSRRFCCGRRAARRVPQRRRENRLQPPDXGSRRRSAYRVCLRRADGFPA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m036.pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
I I : I I I M I I I MINIM I I I I I I I 11 : II M I II I I I I I II I I I I I I I II I I 
a03 6 RTHCRCRLKRRILPAAGCLPPDRPDNRSNGGGSACRTMHKTLRPYVRPQRQGCSFAAAAA 

190 200 210 220 230 240 

250 260 270 

m036 . pep RRRHRARVRRLRGYQTALPNPELHRCRYAVRX 

I It I II I M I I : I I I I I M : I I I I I I 
a03 6 RRRHRARVRRLKEYQTALPNLAPRRCRYAVPX 

250 260 270 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 036 shows 74.9% identity over a 271 aa overlap with a predicted ORF (ORF 036.ng) 
from N. gonorrhoeae: 

m036/g036 

10 20 30 40 50 60 

m036 . pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
I II I I II I I I I I : I I I I I M I I M I MM: I I I M I II I I I I I I II I I I I M M I 
g 036 MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m03 6.pep GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 

I I I I M II M I I I I I I I M II I M M I II I I I I I I I I I M : I I I M :: I I I III 
g036 GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m03 6 . pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: II I I I I I I I I I I i I I I I Ml : MM I I M I M I I MM M : I : 

g03 6 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m036 . pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
MM I I II II I : : Mill I I j I I II M I El M II I II II 1 : I I I I M I I 
g036 RTHCRCRLKRRT PRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 

190 200 210 220 230 240 

250 260 270 

mO 3 6 . pep RRRHRARVRRLRGYQTALPNPELHRCRYAVRX 

II M I I IM: : I II II M II II I I I 
g03 6 RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 

250 260 270 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 133>: 

m036-l . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 
401 
451 
501 
551 
601 
651 



GCGGATGTTC 
GGCGATTTTG 
CAGGATAACC 
ATATCCGGTC 
GCCTATGCCG 
CCCGCCCGAC 
CATGCATAAA 



ACGGGCGCGC 
TTGCGGACGT 
GCCCTTGGCT 
TGCCTGAGAA 
TCTGAAAAGG 
CTGATAATCG 
ACACTTCGCC 



TCTCCGTTCG 
AGAGCAAACC 
GCCAATGCGA 
CGGCGGAGCT 
CGCATCCCTC 
CTCAAACGGT 
CATACGAGCG 



GCCTGTTCTT 
GGCGTGTGCG 
GAAAGTCGGC 
TCTTCCAGCG 
CGGCAGCCGG 
GGCAGTTCAG 
TCCGTAG 



TGGCAGTCAG 
GCATGGACGG 
GGCAGTCGGC 
CGAACGCGCT 
ATGTCTGCCG 
CGTATCGAAC 



This corresponds to the amino acid sequence <SEQ ID 134; ORF 0036- 1>: 

m03 6- 1 . pep 

1 MLKPCAVYSA CAAVL PART S SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 
201 PARPDNRSNG GSSAYRTMHK TLRPYERP* 



m036-i/g036 76.8% identity in 228 aa overlap 



10 20 30 40 50 60 

m03 6-l .pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
INN I I I I! I I : M M I I M f I I I MM: [ M M M I I I M M M M M I M M 
g03 6 MLKPCLVYSACAAALPARTS S SRRCVPSGRCAYQYS SRADATPRRRHSGAVAIRCS S DS S 

10 20 30 40 50 60 

70 80 90 100 110 120 

m03 6-l .pep GRFCQT I KAAIPPSFSARKTCSDGET SAD SNWRCVHADGLQTASSAASSS QSAQTARRMF 
I M M M I M I M M M M I M M M M M M I M M M M M I M I : M I I Ml 
g03 6 GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m03 6-l . pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: M M I M M M M M M M i : MM M M M M I MM I : : I : 

g036 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 229 

m036-l .pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPX 

MM M M I I E : : MM I M M M M M M M M M I II 
g036 RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 

190 200 210 220 230 240 

g03 6 RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 135>: 

g038 . seq 

1 ATGACTGATT TCCGCCAAGA TTT CCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTC AACGACGGCG CGTCCACGCT GCAACTGGCA 

151 AAATTCTATG CACAAT C CAT CATTGAAAGC GGCATC C GAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GC CAAAGAC C GCGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCAATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

451 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTAAATTGT CCGCCGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATCCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GCACCTACCG C C GGCAGT AC GGCGTAGAAT AA 
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This corresponds to the amino acid sequence <SEQ ID 136; ORF 038.ng>: 

g038 .pep 



1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGASTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDRGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GKLSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRTYRRQY GVE* 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 137>: 



1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAGGC AGGACGGC GG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATT TTACG CACAATCCAT CAT T GAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGTCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC AC GGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG AC GTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGAT CAA ACTGATTGAA GCGGAGGGTG CAACCCCcGC CGGTGTCGCC 

451 ATCGCGCTCG ATCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAr AAACAATACG GkCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCC TACCG TCGGCAGTAC GGCGTAGAAT AA 



This corresponds to the amino acid sequence <SEQ ID 138; ORF 038>: 

m038.pep 



1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 139>: 

a038 . seq 



1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTCA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CAT T GAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

4 01 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

4 51 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 



This corresponds to the amino acid sequence <SEQ ID 140; ORF 038.a>: 

a038 .pep 



1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

2 01 EPVRAYRRQY GVE* 



m038 . seq 



m038/a038 



100.0% identity over a 213 aa overlap 



m038 



pep 



10 20 30 40 50 60 

MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
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I I I I I I 1 I M I I I I M M ! II 1 It I I I M I I I I I ! I I I II I I I I i I I I 1 I I II ! I I M ! I 
a038 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 

10 20 30 40 50 60 

70 80 90 100 110 120 

m038 . pep GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
I i I I I I I I I I I I I I I I II it I I M 11 I I 11 M II I ! 1 I I I I I I I I I I I I I II I I I I I I I I 
a038 GIRFDMLFG PAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m038 . pep I I DDVI SAGTSVRES IKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

I I ! M I I I I M I II I I I I I I I I I I I II I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I 
a 038 I I DDVI SAGTSVRES IKLIEAE GAT PAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

130 140- 150 160 170 180 

190 200 210 

m038 . pep ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 
I I I I I I I I I I I I II II M I I I I I M I I I I II I I I 
a 0 3 8 AS LNDLFILLQNN PE FGQFLE PVRAYRRQYGVEX 

190 200 210 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 038 shows 98.1% identity over a 213 aa overlap with a predicted ORF (ORF 038.ng) 
from N. gonorrhoeae: 

m038/g038 

10 20 30 40 50 60 

m038 . pep MT D FRQ D FLK FS L AQN V LK FGE FT T KAGRR S P Y F FN AG L FN DG L S T LQ LAK F Y AQ SUES 

I (I I I I I 1 I I I I I I II I I I II I I I 1) I I I 1 I I 11 I II I 1 I II I I I I I I I 11 I I I I I 1 II 
g038 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGASTLQLAKFYAQSIIES 

10 20 30 40 50 60 

70 80 90 100 110 120 

m03 8.pep GIRFDMLFG PA YKG I I LAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVL VGA PLKGRVL 

I I I M I I I I I I I I I II I I I I I II I I I I I I I I I I I II M I I I I I : I II I I I M I I I I I I I I 
g038 GIRFDMLFG PA YKG 1 1 L AAAT AMM LAE KG VN V P FA YN RKE AK D R GE GG V L V G A P L KGR V L 

70 80 90 100 110 120 

130 140 150 160 170 180 

m038.pep I I DDVI SAGTSVRES IKL I EAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

I I 1 I I I I I M I I I I I I I I I II I 1 II I I I M I M I I I I I I I I : I 1 I I I II I I I I I I I I I I I 
g038 I I DDVI SAGTSVRES I KLIEAEGATPAGVAIALDRMEKGTGKLSAVQEVEKQYGLPVAPI 

130 140 150 160 170 180 

190 200 210 

mO 38 . pep AS LNDLFILLQNNPE FGQFLE PVRAYRRQYGVEX 

I I II I I I I I I I I I I I I I II M I I ! : I I I I I I I I I 
g038 ASLNDLFILLQNNPEFGQFLEPVRTYRRQYGVEX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 141 >: 

g039 . seg 

1 ATGCCGTCCG AACCACCTGC CGCTTCAGAC GGCAT CAAAC CGACACACAC 

51 CGAGAAAACA TCATGCCCGC CTGTTTCTGT CCGCACTGCA AAACCCGCCT 

101 CTGGGTCAAA GAAAcccagC TCAAcgtCgC ccaagGCTTC GTCGTCTgcc 

151 aaAAAtgcga agGGCTgttt aaAgccaaaG accAtctggc aaGcacGAAA 

201 gaacctatat tcaacgattg gcccgaagct gtttcgggat gTcaaaCTCG 
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251 TCcaccgcaT cggcacgcac gccattagca aGAaacagat gtcccgcgac 

3 01 gaaatCgccg atatcctcaa cggcggtaca acCCTGCACG ATACGCCGCC 

3 51 CGCAACCGCC GCTGCCGCac ctGCCGCCGC ACCGCaggTT TCCGTACCGC 
401 CCGCCCGTCA GGAAGGGCTC AAC TGG ACT A TTGCAACCCT GTTCGCACTT 

4 51 ATCGTCCTCA TTATGCAGCT TTCCTACCTC TTCATCCTAT GA 



This corresponds to the amino acid sequence <SEQ ID 142; ORF 039.ng>: 

g039 .pep 

1 MPSEPPAASD GIKPTHTEKT SCPPVSVRTA KPASGSKKPS STSPKASSSA 

51 KNAKGCLKPK TIWQARKNLY STIGPKLFRD VKLVHRIGTH AISKKQMSRD 

101 EIADILNGGT TLHDTPPATA AAAPAAAPQV SVPPARQEGL NWTIATLFAL 

151 IVLIMQLSYL FIL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 143 



m039 . seq 






1 


ATGCCGTCCG 


AACCGCCTTA 


51 


CGAGGAAATA 


CCATGCCCGC 


101 


CTGGGTCAAA 


GAAACCCAAC 


151 


nnnnnnnnnn 


nnnnnnnnnn 


201 


nnnnnnnnnn 


nnnnnnnnnn 


251 


CGGATGTCAA 


ACTCGTTCAC 


301 


CAGATTTCCC 


GTGACGAAAT 


351 


GCCCGATATT 


CCGCCCGCAA 


401 


TTACCGTACC 


GCCCGCCGCG 


451 


ATTGCAACCC 


TGTTTGCCCT 


501 


CGTCATCCTA 


TGA 



nnnnnnnnnn nnnnnnnCCC GAGGCTGTTT 

CGTATCGGCA CGCGCGCCAT CGGCAAGAAA 

CGCCGGCATC CTCAACGGCG GTACAACCCA 

CCGCCGCCAC CCCTGCTGCC GCACCGCAGG 

CCCGCCCGTC AGGATGGGTT CAACTGGACG 



This corresponds to the amino acid sequence <SEQ ID 144; ORF 039>: 

m039 .pep 

1 MPSEPPYASD GI KPDTHEE I PCPPVSAPTA KPVSGSKKPN SMSPXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXP EAVSDVKLVH RIGTRAIGKK 

101 QISRDEIAGI LNGGTTQPDI PPATAATPAA APQVTVPPAA PARQDGFNWT 

151 IATLFAKEVL IMQLSYLVIL * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 145>: 

a039.seq 

1 ATGCCGTCTG AACCGCCTTA CGCCTCAGAC GGCATCAAAC CTGACACACA 

51 CGAGGAAATA CCATGCCCGC CTGTTTCTGC CCCCACTGCA AAACCCGTCT 

101 CTGGGTCAAA GAAACCCAAC TCAATGTCGC CCAAGGCTTC GTCGTCTGCC 

151 AAAAATGCGA AGGAATGTTT AAAGCCAAAG ACCATCTGGC AAGCACGAAA 

201 G AAC C CAT AT TCAACGATT . TGCCCGAAGC TGTTTCGGAT GTCAAACTCG 

2 51 TTCACCGCAT CGGCACGAGC GCCATCGGCA AGAAACAGAT TTCCCGTGAC 

301 GAAATCGCCG GCATCCTCAA CGGCGGCACA ACCCAGCCCG ATATTCCGCC 

351 CGCAACCGCC GCCACCCCTG CTGCCGCACC GCAGGTTACC GTACCGCCCG 

4 01 CCGCGCCCGC CCGTCAGGAT GGGTTCAACT GGACGATTGC AACCCTGTTT 

4 51 GCCCTTATCG TCCTCATTAT GCAGCTTTCC TACCTCGTCA TCCTATGA 

This corresponds to the amino acid sequence <SEQ ID 146; ORF 039.a>; 

a039.pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPKASSSA 

51 KNAKECLKPK TIWQARKNPY STIXPEAVSD VKLVHRIGTS AIGKKQISRD 

101 EIAGILNGGT TQPDIPPATA ATPAAAPQVT VPPAAPARQD GFNWTIATLF 

151 ALIVLIMQLS YLVI L* 



m039/a039 79.4% identity over a 1 70 aa overlap 



10 20 30 40 50 60 

m039 .pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

I I I I I i 1 I I I 1 1 M 1 I 1 1 I I I M ! 1 i I ! I I M I I I ! I I I ! I I ! I 
a039 MPSEPPYASDGIKPDTHEEI PCPPVSAPTAKPVSGSKKPNSMSPKASSSAKNAKECLKPK 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m039 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 
: : I I I I I I I I I I I II I I I I I I M I I I I I I I I i I i I I I I i I I I I 

a 039 TIWQARKNPYSTIX PEAVSDVKLVHRIGTSAIGKKQISRDEIAGILNGGTTQPDI 

70 80 90 100 110 

130 140 150 160 170 

m039.pep PPATAATPAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 
I I I I M I I I M I I II I I M I I I I I M I I M I I I 1 i ! I I I M I I E ! I t M i I 
a 039 PPATAATPAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 
120 130 140 150 160 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 039 shows 60.8% identity over a 171 aa overlap with a predicted ORF (ORF 039.ng) 
from N. gonorrhoeae: 

m039/g039 

10 20 30 40 50 60 

m03 9 . pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

Mill: IIIIIM h llllh illhlllllhl I I 

g03 9 MPSEPPAASDGIKPTHTEKTSCPPVSVRTAKPASGSKKPSSTSPKASSSAKNAKGCLKPK 

10 20 30 40 50 60 

70 80 90 100 110 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 

I MMMMIhMMIhMMM IIIIIM I 

g03 9 TIWQARKNLYSTIG PKLFRDVKLVHRI GTHAI SKKQMSRDE I AD I LNGGTTLHDT 

70 80 90 100 110 

130 140 150 160 170 

mO 3 9 . pep PPATAAT- PAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLS YLVILX 

II 1 1 II : MIIMhIIII IhhMIIMMIMIIIIIIIII Ml 

g03 9 PPATAAAAPAAAPQVSVPPA RQEGIiNWTIATLFALIVLIMQLSYLFILX 

120 130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 147>: 

g040 . seq 

1 ATGAACGCGC CCGACAGCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCCAA ATG CGCGGCA CGACACTGGT CGCCGGCATA GAcggCCGCC 

101 TGCTCGAAGG CGGCACCTTA AATAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCCGACT CGTCCTCATC CACGGCGCGT ACCACTTCCT 

201 CGAccgCCTC GCCGCCGCGC AAGgccGCAC GCCGCATTAT TGCCGgggtt 

251 tGCGCGTTAC CGACGaAACc tcGctcgGAC AGGCGCAGCA GtttGCCGGC 

301 AccgTCCGCA GCCGTTTTGA agcCGCATTG tgcggcagCG tttcaggatt 

351 cgcgCGCGCG CCTTCCGTCC CGCTCGTAtc gggcaacttc ctgacCGCCC 

401 GTCcgatggg cgtgattgac ggaACCGata tggaatacgc gggggttatc 

451 cgcaaaaccg ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGATG CCGCCGCTCG GGCATTCCTA CGGCGGCAAA ACCTTCAATC 

551 TCGATATGGT GCAGGCCGCC GCTTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGCCAGCGA AACCCGACGA CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

751 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGGGCCGCCG ACGGCAGCCT 

801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

851 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATC 

901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCGTCCTAT TGCACCGCAG 

951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTCGAACACG 
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1001 ACGGCGACCT GTACGGCTGT GCCGCACTCA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGg 

1101 ctACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGCTGCCCG AAACGCGGCG 

1251 CAAAGACTAC CGCAGCAACG GACGAAACCC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 148; ORF 040.ng>: 

g040.pep 

1 MNAPDSFVAH FREAAPYIRQ MRGTTLVAGI DGRLLEGGTL NKL AADIGLL 

51 SQLGIRLVLI HGAYHFLDRL AAAQGRTPHY CRGLRVTDET SLGQAQQFAG 

101 TVRSRFEAAL CGSVSG FARA PSVPLVSGNF LTARPMGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWM PPLGHSYGGK TFNLDMVQAA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAASETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AAL1RPLEEQ GVLLHRSREY LENHISEFSI LEHDGDLYGC AALKT FAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNPHIL VRRLHR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 149>: 

m040 . seq 

1 ATGAGCGCGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGTCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCTC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

2 51 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAgCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCTATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGCCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGGCGGGCA AACGCGACGG CTGATTTCGT CCGCCGAACT CTTCACCCGC 

7 51 AACGGCATCG GCACGTCCAT TGCCAAAGAA GCCTTCGTCT CCATCCGGCA 

801 rGCGCAywgG G . CGACATCC CGCACATCGC CGCCCTCATC CGCCCGCTGG 

851 AAGAACAGGG CATCCTGCTG CACCGCAs . c GCGAATACCT CGAAAACCAC 

901 ATTTCCGAAT TTTCCATCCT CGAACACGAC GGCAACCTGT ACGGTTGCGC 

951 CGCCCTGAAA ACCTTTGCCG AAGCCGATTG CGGCGAAATC GCCTGCCTTG 

1001 CCGTCTCGCC GCag.cACAG GACGGCGGCT ACGGCGAACG CnTGCTTGCC 

1051 CAC ATT AT CG ATAAGGCGCG CGGCATAGGC ATAAGCAGGC TGTTCGCACT 

1101 GTCCACAAAT ACCGGCGAAT GGTTTGCCGA ACGCGGCTTT CAGACGGCAT 

1151 CGGAAGACGA GTTGCCCGAA ACGCGGCGCA AAGACTACCG CAGCAACGGA 

1201 CGGAACTCGC ATATTCTGGT ACGTCGCCTG CACCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 150; ORF 040>: 

m040 . pep 

1 MSAPDLFVAH FREAVPYIRQ MRGKTLVAGI DDRLLEGDTL NKLAADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFYLDMLQTA ASAAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAGGQTRR LISSAELFTR 

251 NGIGTSIAKE AFVSIRQAHX XDIPHIAALI RPLEEQGILL HRXREYLENH 

301 ISEFSILEHD GNLYGCAALK TFAEADCGEI ACLAVSPQXQ DGGYGERXLA 

351 HIIDKARGIG ISRLFALSTN TGEWFAERGF QTASEDELPE TRRKDYRSNG 

4 01 RNSHILVRRL HR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 15 1>: 

a040 . seq 
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1 ATGATCGTGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGTTCG CCGCCGACAT CGGGCTTTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCGC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAGCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCCATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGTA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

7 01 CCGGCGGCGA AACGCGACGG CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

7 51 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGAGCCGCCG ACGGCAGCCT 

801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

851 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATT 

901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCATCCTGC TGCACCGCAG 

951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTCGAACACG 

1001 ACGGCAACCT GTACGGTTGC GCCGCCCTGA AAACCTTTGC CGAAGCCGAT 

10S1 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGG 

1101 CTACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 G CAT AAG C AG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGTTGCCCG AAACGCGGCG 

1251 C AAAG ACT AC CGCAGCAACG GACGGAACTC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 



This corresponds to the amino acid sequence <SEQ ID 152; ORF 040.a>: 

a040-pep 

1 MIVPDLFVAH FREAAPYIRQ MRGKTLVAGI DDRLLEGDTL NKF AADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFHLDMLQTA ASVAVSLQAE 

2 01 KLVYLTLSDG ISRPDGTLAV TLSAQEAQSL AEHAGGETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GILLHRSREY LENHISEFSI LEHDGNLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNSHIL VRRLHR* 



m040/a040 91.5% identity in 436 aa overlap 

10 20 30 40 50 60 

m04 0 . pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 
I : I 1 I 1 II II I t I : I I 1 M II M II 11 I I I II I M 1 I II I 1 : I I II M I I I I I 1 M I M 
a 04 0 MIVPDLFVAHFREAAPYIRQMRGKTLVAGIDDRLLEGDTLNKFAADIGLLSQLGIRLVLI 

10 • 20 30 40 50 60 



70 80 90 100 110 120 

m040 . pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 
! I I I I I 1 1 I I I i I I I 1 i 1 I I I I I II 1 1 I I i ! I I I I I I II I I I 1 I M 1 I I I I 1 I I I I I It I 
a 040 HGARHFLDRHAAAQGRT PHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 

70 80 90 100 110 120 



130 140 150 160 170 180 

m04 0.pep PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 
I II I I I I M I I I I I I I I I I I I II I II I I I I I I II I I I II M I I I I I I I I II I I I II 1 I I I 
a04 0 PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 

130 140 150 160 170 180 



190 200 210 220 230 240 

m04 0 . pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 
I I : I M I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 1 !: M I 
a04 0 TFHLDMLQTAASVAVSLQAEKLVYLTLSDGISRPDGTLAVTLSAQEAQSLAEHAGGETRR 
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190 200 210 220 230 240 



250 260 270 

m04 0. pep LISSA ELFTRNGIGTS IAKEAFVSIRQAHXXDIPHI 

I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I M I 

a04 0 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 

250 260 270 280 290 300 

280 290 300 310 320 330 

m04 0 .pep AALIRPLEEQGILLHRXREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 
I I I II I I I I I I I I M I 1 I I I M I I M I I I I I I I I If I M I I I I I I I II I I I I I I M I M 
a04 0 AALIRPLEEQG I LLHRSRE YLENHI SE FS I LEHDGNLYGCAALKT FAEADCGE I ACLAVS 

310 320 330 340 350 360 

340 350 360 370 380 390 

m040 . pep PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 
I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I | | | | 
a04 0 PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

370 380 390 400 410 420 

400 410 
m04 0 . pep RSNGRNSHILVRRLHRX 
I M I I I I I I II I I I I I I 
a0 4 0 RSNGRNSHILVRRLHRX 

430 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 040 shows 88.3% identity over a 436 aa overlap with a predicted ORF (ORF 040.ng) 
from N. gonorrhoeae: 

m040/g040 



m04 0 . pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 60 

I : I i I I I I I I I I I : I I M I I M I I I I I I I If I I t I I II II t I M II I I I I I I I I I I 
g04 0 MNAPDSFVAHFREAAPYIRQMRGTTLVAGIDGRLLEGGTLNKLAADIGLLSQLGIRLVLI 60 

m04 0.pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 120 

III I I I II I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I | | | U | | 
g04 0 HGAYHFLDRLAAAQGRTPHYCRGLRVTDETSLGQAQQFAGTVRSRFEAALCGSVSGFARA 120 

m04 0 . pep PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 180 

I I I M I I I I I I I I I I : I I I I I I I I I I I I I I I I I I M M I II I I I I I I I I : I I I I I I I : || 

g04 0 PSVPLVSGNFLTARPMGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWMPPLGHSYGGK 180 

m04 0.pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 24 0 

II I I I : I :[ I I : M I I I I I I I I I I I I I I I i I I I I 1 I I I I I I I I I I I I M I I I I ::: I I I 

g04 0 TFNLDMVQAAASVAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAASETRR 24 0 

mO 4 0 . pep LI SS A ELFTRNGI GT S I AKEAFVS IRQAHXXDI PHI 27 6 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

g04 0 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 300 

m04 0 .pep AALIRPLEEQGILLHRXREYLENHISEFS I LEHDGNLYGCAALKT FAEADCGE I ACLAVS 336 

I I I I I I I I I I I : I I I I I I II M I I I I I J I M M I : I I I I I I I I I I I I I I I I I I I I I I | | 

g04 0 AALIRPLEEQGVLLHRSREYLENHISEFSILEHDGDLYGCAALKTFAEADCGEI ACLAVS 3 60 

m04 0.pep PQXQDGGYGERXLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 3 96 

II I I I I I I I I I I I I 1 I I I I II I I I I I I I I M I I I I I | I I I I I | | | | | M | | | | | || | | 

g04 0 PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 420 

m040.pep RSNGRNSHILVRRLHRX 413 

I I I I I I I I I I I I I I I I 
g04 0 RSNGRN PH I L VRRLHRX ■ 4 37 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 153>: 

g041 . seq 

1 ATGAGTTCGC CCAAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGCCT 

51 GATTACCGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGTGCGCTGG 

101 TGTGCGAAGT ACCGCTGACC GATATGATCC GTTATCCGCT GCTGTCCGCC 

151 GGTTCAAGTT GGACGGACGA ATACGGCAAT CCGCAGAAAT ACGAAGCCTG 

201 CAAACGCCGG CTGGGCGAAT TGTCGCCGTA TCACAATCTT T C AG ACGGCA 

251 TCGATTATCC GCCCGCACTC ATTACCACCA GCCTCAGCGA CGACCGCGTC 

301 CATCCCGCCC ACGCGCTCAA ATTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC AAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

4 51 GAATTTTTGG GATAA 

This corresponds to the amino acid sequence <SEQ ID 154; ORF 041.ng>: 

g041 . pep 

1 MSSPKHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEACKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQRESAD KLACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 155>: 

m041 . seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC G AC AT GAT C C GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAG ACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCCGC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC GAACTCGCCT GCGTCTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 156; ORF 04 1>: 

m041 .pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSAQSW LYSPDGGGHT GNGTQRESAD ELACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 157>: 

a041 . seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATA GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

2 51 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CGCAGCGCGA AGCCGCCGAC GAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 158; ORF 041.a>: 

a04 1 .pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQREAAD ELACVLLFLK 
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151 EFLG* 

m04i/a04i 98.7% identity over a 154 aa overlap 

10 20 30 40 50 60 

ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
M I I I I I 1 I I I I M I I II M I II I I I I I I II I I I I I I I I I I i I M M I I I I I I M I I I I | 
ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
10 20 30 40 50 60 

70 80 90 100 110 120 

PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVH PAHALKFYAKLRETSAQSW 
I M I I I I I I I I I I I I I I M I I I I I I I I M I I I f I I II I I II I I I I I | i | | | | | | | | j|| 
PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 
70 80 90 100 110 120 

130 140 150 

LYSPDGGGHTGNGTQRESADELACVLLFLKEFLGX 
I I I I I II I I I I I I I M I : ! I I I II I I I I I I I || I | 
LYSPDGGGHTGNGTQREAADELACVLLFLKEFLGX 
130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 041 shows 96.8% identity over a 154 aa overlap with a predicted ORF (ORF 041. ng) 
from K gonorrhoeae: 

m041/g041 

10 20 30 40 50 60 

ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
: I M : I I I I I I II I I I I I I I M M I M I M I I I I I I I! M I M I I I I I I I I I I I I I M II 
MSSPKHIGLQGGSNGGLITAAAFVRE PQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 
10 20 30 40 50 60 

70 80 90 100 110 120 

PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 1 i I I I I I I I I I I M Ml 
PQKYEACKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 
70 80 90 100 110 120 

130 140 150 

LYS PDGGGHTGNGTQRESADELACVLLFLKE FLGX 
I I M I I I I M 1 I I I I I I I I I : I I I I I I M I I | | | | 
LYSP DGGGHTGNG TQRE S ADKLAC VLL FLKE FLGX 
130 140 150 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 159>: 

g041-l . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCG CGCGCACTTT CAGACGGCAT TTTGAATCAA 

151 ATGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CAT C AG AAT G CGGAATATCC GAAGGGCGTG TACCGCATGT 

251 GTACGGCGGC GACCTACCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGATGA GTTGCTCGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAGC AGCCCAACCG CGCGCTGCTG ACTTTGAACA 

4 01 AATCGGGCGG CGATACGGCG TATACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAATTGGTAG AGGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAAG TGTGGCTGGT GGAACGCGGC 



m04 1 . pep 
a041 

m04 1 . pep 
a041 

m041 . pep 
a041 



m041 . pep 
g041 

m041 . pep 
g041 

m041 . pep 
g041 



WO 99/57280 



PCT/US99/09346 



232 



601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 

This corresponds to the amino acid sequence <SEQ ED 160; ORF 041-l.ng>: 

g0 4 1-1 . pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILNQ 

51 MQDTRQIPFC QEHRARMYHF HQNAEYPKGV YRMCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLNKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPAY QIDKGAMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

251 SEGGAKPLNL PNDCDWGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADSK 

351 WQEAELPHLP SGALEMTDQP WGG D VV YLAA SDFTTPLTLF ALDLNVMELT 

401 VMRLQPQQFV SDGIEVRQFW AVSSDGERIP YFHVGKNAAP DTPTLVYAYG 

4 51 GFGIPELPHY LGSVGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV RDLSERGMSS PKHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEACKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQRESADKLA CVLLFLKEFL G* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 161>: 

m041-l . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAGAAAA CGACAAGGCG CGCGCGCTTT CAGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAGGACG CGGAGTATCC GAAGGGCGTG TACCGCGTGT 

2 51 GTACCGCGGC GACGTATCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTTGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

4 01 AATTGGGCAG CGATACGGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAGTTGGTCG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG AACGAACGCC 

551 AGTTGACCCA ATCGGGCTAT CCGCGCGAAG TATGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT GCCTGTGTAT CAAATCGGCG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ATCTCGATCC GCAGGGTTCG CCGATTGATT 

7 01 TGATTGAAGC GTCGGACGGT TTTTACACCA AAACCTATTT GCGGGTCTCA 



AAGAGTTTCG 
GATGGTAAAC 
TGATTGAAGC 
TCCGAAGGCG 
CGGCTATCTG 
GCGCGAACCA 
CGGGGCGAAC 
GGCATTGGAA 
TGGAGAATGT 
TGGCAGGAAG 
CGACCAACCG 
CCACGCCGCT 
GTCATGCGCC 
GCAGTTTTGG 
TCGGCAAAAA 
GGTTTCGGCA 
TTGGCTGGAA 
GAGAATTCGG 
AAAAGCGTTG 
CATGAGTTCG 
TGATTACCGC 
GTGTGCGAAG 
CGGTTCAAGT 
GCAAACGCCG 
ATCGATTATC 
CCATCCCGCC 
CGCAATCTTG 
ACCCAACGCG 
AGAATTTTTG 



AGGAAAGCCT 
GCGTGGCGTT 
GTCGGACGGT 
GGGCGAAACC 
GCGGGACATC 
AAGCTATCCG 
TCGGGGCGGC 
AGCGTGGAAA 
ACAAGGCCGT 
CCGAGTTGCC 
TGGGGCGGCG 
GACGCTGTTT 
TCCAGCCGCA 
GCGGTGTCGT 
CGCCGCGCCC 
TTCCTGAATT 
GAGGGCAATG 
CCCGCGCTGG 
ATGATTTGTT 
CCCAAACACA 
CGCCGCCTTC 
TACCGCTGAC 
TGGACGGACG 
GCTGGGCGAA 
CGCCCGCACT 
CACGCGCTCA 
GCTCTACTCG 
AATCCGCCGA 
GGATAA 



GCCGGCGTAC 
ACCTCGATCC 
TTTTACACCA 
GTTGAACCTG 
TTTTGCTGAC 
AGTGGCGCGT 
GCAGCTTTTG 
CGACCAAGCG 
CTGAAAGCGT 
GCACCTGCCC 
ACGTGGTTTA 
GCGCTGGATT 
GCAGTTTGTT 
CCGACGGCGA 
G AC AC G C C G A 
GCCGCATTAT 
CCTTTGTATT 
CATCAGGCGG 
GGCAGTCGTG 
TCGGCTTGCA 
GTGCGCGAAC 
CGATATGATC 
AATACGGCAA 
TTGTCGCCGT 
CATTACCACC 
AATTCTACGC 
CCTGACGGCG 
CAAACTCGCC 



CAAATCGATA 
GCAGGGTTCG 
AGACGTATTT 
CCTAATGATT 
GCTGCGCAAG 
TGGTGGCGGT 
TTTGCGCCCG 
TTTTGTGGTG 
GGCGGTTTGC 
TCGGGCGCGT 
TCTTGCCGCC 
TGAACGTGAT 
TCAGACGGCA 
ACGCATTCCT 
CCTTAGTCTA 
CTGGGCAGCG 
GGCAAACATC 
CGCAGGGAAT 
CGTGATTTGT 
GGGCGGCAGC 
CGCAAAGCAT 
CGTTATCCGC 
TCCGCAGAAA 
ATCACAATCT 
AGCCTCAGCG 
CAAACTGCGC 
GCGGCCATAC 
TGCGTGTTGC 



AAGGCGCGAT 
CCGATTGATT 
GCAGGTGTCG 
GCGATGTGGT 
GACTGGCACC 
GAAACTGAAT 
ATGAAACGCA 
GCAAGCCTGC 
CGACAGCAAA 
TGGAAATGAC 
AGCGATTTCA 
GGAACTGACC 
TCGAAGTGCG 
TATTTCCACG 
TGCTTACGGA 
TCGGCAAATA 
CGCGGCGGCG 
CAGCAAACAC 
CCGAACGCGG 
AACGGCGGCC 
CGGTGCGCTG 
TGCTGTCCGC 
TACGAAGCCT 
TTCAGACGGC 
ACGACCGCGT 
GAAACCTCGC 
CGGCAACGGC 
TGTTTTTGAA 
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"7 51 GCCGAAGGCG AGGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTGGT 

801 CGGCTATCTG GCGGGGCATC TTTTGCTGAC GCTGCGCAAG GACTGGAACC 

851 GCGCGAACCA AAGCTATCCG AGCGGCGCGC TGGTGGCGGT GAAGCTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCGAGCCTGT 

1001 TGGAGAACGT ACAAGGCCGT CTGAAAGCAT GGCGGTTTGC CGACGGCAAA 

1051 TGGCAGGAAG TCGAATTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCT TGGGGCGGCG ACGTGGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

1401 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 

1451 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG AT GAT TT ATT GGCAGTCGTG CGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

1701 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 

1751 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 AT C GAT TAT C CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 

1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCCG 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CGAACTCGCC TGCGTCTTGC TGTTTTTGAA 

2001 AGAGTTTTTG GGCTAA 

This corresponds to the amino acid sequence <SEQ ID 162; ORF 041 -1>: 

m041-l.pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLENDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKLGSDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW NERQLTQSGY PREVWLVERG 

201 KSFEESLPVY QIGEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLRVS 

251 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWNRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADGK 

351 WQEVELPRLP SGALEMTDQP WGGDWYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV RDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSAQSWLYS PDGGGHTGNG 

651 TQRESADELA CVLLFLKEFL G* 

m041-l/g041-l 94.6% identity in 671 aa overlap 

10 20 30 40 50 60 

m041-l.pep MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 
I I I I I M I II I I I I I I I I ! II I I I I I I I M I ! I I I : I 1 i I I II II If I I : I I I I I I II I 
g041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILNQMQDTRQIPFC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m041-l.pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLG DDVYLGGVSH 
I I I I I I I I M I I : I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g041-l QEHRARMYHFHQNAEYPKGVYRMCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m041-l.pep LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGG FHFPAGKNHVSWRDENSVWVCPAW 
I J I I I I I I I I II : I I : I I I I I I I I I I I I I I I I I I | I I I I I I I I I I II I I I I I I I I I I I I 
g041-l LVEQPNRALLTLNKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDEigSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m041-l.pep NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGS PIDLIEASDG 
: II II I : I I I I I I M II I I I I I I I I I I I : I I I : : I I I I I I I I I I I I I I I I I I I I I I I | 
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g04l-l DERQLTESGYPREVWLVERGKSFEESLPAYQIDKGAMMVNAWRYLDPQGSPIDLIEASDG 

190 200 210 220 230 240 

250 260 270 280 290 300 

m04 1-1 . pep FYTKTYLRVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 

I I I I t I I : M : I I I I I I I M I! I! I II I M I I I I I I M I II : I I I I I I I I M I I I II I I 
g041-l FYTKTYLQVSSEGGAKPLNLPNDCDVVGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 

250 260 270 280 290 300 

310 320 330 340 350 360 

m04 1-1 . pep RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 
M I I I M I I II I I I I I ! I I I I I I M I I I I M I I I I I M I I I II I I I I I : II I I : I I I : I I 
g04l-l RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADSKWQEAELPHLP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m04 1-1 .pep SGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 

II I 1 I It I I II M 1 I I I i I M I I 1 I li I I I II II I 1 I M M M Mill I I I I : I : I I i 
g0 41-l SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRLQPQQFVSDGIEVRQFW 

370 380 390 400 410 420 

430 440 450 460 470 480 

m04 1-1 .pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
: : I : t I M I I I I H I 1 I I I M I I ! I M I I 1 I I I 11 I I I I !■ II : II I M I M t I ! I III I 
■ g041-l AVSSDGERIPYFHVGKNAAPDTPTLVYAYGGFGIPELPHYLGSVGKYWLEEGNAFVLANI 

430 440 450 460 470 480 

490 500 510 520 530 540 

m041-l.pep RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 
I! II I M i I I I I I I I i I I I I i I I i M I I ) I I I I i I I I : 1 I I : I I I M I I i I i I I I I I I M 
g041-l RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGMSSPKHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 

550 560 570 580 590 600 

m04 1-1 . pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
I M I I I I I I II I I M I I I I I I I I II I I I I I I I I I I I I I I I I I : I I I I I I I II I I I I I I I I 
g041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEACKRRLGELSPYHNLSDG 

550 560 570 580 590 600 

610 620 630 640 650 660 

m041-l .pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 
I I I I I I I I I I I II I I I I I I II I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I : I I 
g041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQRESADKLA 

610 620 630 640 650 660 

670 

m041-l .pep CVLLFLKEFLGX 
I I I I M I I I I I I 
g041-l CVLLFLKEFLGX 

670 

m041-l/P55577 

sp| P55577 | Y4NA_RHISN PROBABLE PEPTIDASE Y4NA >gi 12182536 (AE000086) Y4nA [Rhizobium sp . 
NGR234) Length « 726 
Score = 370 bits (940), Expect = e-101 

Identities = 217/682 (31%), Positives = 331/682 (47%), Gaps = 22/682 (3%) 

Query: 2 KSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFCQ 61 

K DP + +D + + N T + ++ + L LQ T +1 

Sbjct: 42 KDASDPRAYLNEIDGDKAMTWVEAHNLSTVDKLSKDPRYSEYQADALTILQATDRIASPS 101 

Query; 62 EHRARMY-HFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 120 

R M +F QD + +G++R T +YRSG P+W+ + V + G G 
Sbjct: 102 FARDGMI DNFWQDGTHVQGLWRRTTWESYRSGNPQWRTILDVDALSKAEGKTWVFEGGDC 161 



Query: 121 LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSKRDENSVWVCPAW 180 
L N L+ LS G D E D+ GE V+ GF P GK V+W DEN+++V W 
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Sbjct: 162 LPPTSNLCLIRLSDGGKDADVVREFDIAKGEFVKEGFVLPEGKQSVTWVDENTIYVTREW 221 



Sb j ct : 


162 


Query : 


181 


Sbjct: 


222 


Query: 


233 


Sbjct : 


282 


Query : 


292 


Sbjct: 


338 


Query : 


348 


Sbjct : 


398 


Query: 


408 


Sbjct: 


458 


Query: 


465 


Sbj ct : 


518 


Query: 


525 


Sbjct: 


578 


Query: 


585 


Sbjct: 


637 


Query: 


645 


Sbjct: 


697 



++T SGY 



FY 



+V+RG+S ++++ +++ 



-IGEDGMM — VNAWRYLDPQGSPI 232 
E G++ ++ +D + 



+ L LP GY G + L+ DW A + + + 
PDTRKVVLPLPTTAVFSGYYKGQAIYWLKSDWTSAKGTVFHN 337 



GA++A L 



G W 



+L 



A+ + 



L +T 



-LFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFA 34 7 
LF P+E Q++ TK +V S+L NV ++++ F 



P TLF 



+FD+ G+ QQFW TS DG ++PYF V + 



GK WLE+G A+ LANIRGGGEFGP+WH A 



PT++YAYGGF IP P Y 



++ + DD AV +DL + ++S H+ 



G+ GGSNGGL+ 



++ p 



A+V +VPL DM+ + +SAG+SW EYG+P 



L +SPYHN+ G+ YP 



GH +E A 



TS DDRV P HA K 



+ Y 



identified in N. meningitidis <SEQ ID 163>: 



The following partial DNA sequence was 

a041-l . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCA CGCGCATTGT CTGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAAAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

2 01 GTACCATTTC CAT CAAGAT G CGGAATATCC GAAAGGCGTG TACCGCGTGT 

251 GTACCGCGGC GACTTACCGT TCGGGCTATC CT G AG T G G AA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTCGGT GACGATGTAT ATCTAGGCGG 

351 CGTGTCGCAC CTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACT GAG C A 

4 01 AATCGGGCGG CGATACCGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAGTTGGTAG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAGG TGTGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT GCCGGTGTAC CAAATTGCTG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 

7 01 TGATTGAAGC GTCTGACGGT TTTTACACCA AAACCTATTT GCAGGTCTCA 

7 51 GCCGAAGGCG AAGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTAGT 

801 CGGCTATCTG GCCGGACATC TTTTGCTGAC CTTGCGTAAA GACTGGCACC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCAT TGGTAGCAGT AAAATTAAAC 

901 CGCGGCGAAT TGGGCGCGGC GCAGCTTTTG TTTGCGCCCA ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTCGTG GCGAGCCTGC 

1001 TGGAAAACGT ACAGGGTCGT CTGAAAGCGT GGCGTTTTAC TGATGGCAAA 

1051 TGGCAGGAAA CCGAGTTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAAGCG TGGGGGGGCG ACGTAGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCATTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 
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14 51 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG AGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT AGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

1701 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 

17 51 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT AT C AC AAT CT TTCAGACGGC 

1801 ATCGATTATC CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 

1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCGC 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACGCAGCGCG AAGCCGCCGA CGAACTCGCC TGCGTGTTGC TGTTTTTGAA 

2 001 AGAGTTTTTG GGCTAA 



This corresponds to the amino acid sequence <SEQ ID 164; ORF 041-l.a>: 

a041-l.pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPVY QIAEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

2 51 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPNETQALE SVETTKRFW ASLLENVQGR LKAWRFTDGK 

351 WQETELPRLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV SDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQREAADELA CVLLFLKEFL G* 



a041-l/m041-l 97.9% identity in 671 aa overlap 



10 20 30 40 50 60 

aO 4 1-1. pep MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILAQLQDTRQIPFC 
I I I I I I I I I I i II I I I I I I I I I I I I I I i I I I I I I i : I I If I I I! M I I J I i I I M I I I I I 
m041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 

10 20 30 40 50 60 



70 80 90 100 110 120 

a041-l . pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I I I I I I I I II I I I M I I I I I I M I I I I M I I I II I I I II I I I I I I M I I I I I I M I I I M 
m041-l QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 



130 140 150 160 170 180 

a04 1-1 . pep LVEQPNRALLTLSKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I M I I I I I I I I II I I 
m041-l LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 



190 200 210 220 230 240 

a041-l .pep DERQLTESGYPREVWLVERGKSFEESLPVYQIAEDGMMVNAWRYLDPQGS PIDLIEASDG 
: I I M I : M H II t M II N I M I II I M M ! : I 11 M M 1 t I I II I M I U { I II M I 1 
m041-l NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGS PIDLIEASDG 

190 200 210 220 230 240 



250 260 270 280 290 300 

a041-l.pep FYTKTYLQVS AEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 
I I ! I I M : I I II I I I I! M II I I ! I I I ! I i I I I I I I I I! II I : I M I I M ! I I I I I! I I I 
m041-l FYTKTYLRVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 

250 260 270 280 290 300 



310 320 330 340 350 360 

a041-l .pep RGELGAAQLLFAPNETQALESVETTKRFVVASLLENVQGRLKAWRFTDGKWQETELPRLP 
I I I M M 1 M t t I : M 1 M I I I t II I I II I I II I 1 I I I I I I I M M : I I I I I 1 : I I ! I I I 
m041-l RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 

310 '320 330 340 350 360 
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370 380 390 400 410 420 

a041-l .pep SGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M If I I I I I I I M M I I I M I I I 
m041-l SGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 

370 380 390 400 410 420 

430 440 450 460 470 480 

a041-l.pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
I I I M I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I | | | | | | | | | | | | M f 
m041-l TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGI PELPHYLGSIGKYWLEEGNAFVLANI 

430 440 450 4 60 470 480 

490 500 510 520 530 540 

aO 4 1-1. pep RGGGEFGPRWHQAAQGISKHKSVDDLLAVVSDLSERGISSPEHIGLQGGSNGGLITAAAF 
M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | || 
m041-l RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 

550 560 570 580 590 600 

a041-l.pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
M I I I I I I I I M I I I I I I I I I I I I I I I I I | I | M M M | | M I I I M I | | | | | | | | M | | 
m041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 

550 560 570 580 590 600 

610 620 630 640 650 660 

a041-l .pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQREAADELA 
I I M I I I II I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I II : I I I II 
m041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 

610 620 630 640 650 660 



670 

a041-l.pep C VLLFLKE FLGX 
I I I I I I I I I I I I 
mO 4 1 - 1 CVLLFLKEFLGX 

670 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 165>: 

g042 . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAagcG TTCGTGCCGC ATACCAGCGC 

51 GTTATCCAAC ACTTCCACGG CAGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TGCGGTCGAT GATGAAAATC CAGCCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACGG GCTGCCCGTG CCCTTCGTTG CGTAAAGATT CGTCCACGGG 

2 01 CGGCAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GATTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCTGTAACCG AC AG C AC C AG CCCGCGTCCT 

301 TTGCCTTTGG cggCTTCGCG CTTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 ATTCAATTCC GCCACGCGCG CTTCCTTACC GAAAATCCGC GACAGGGTCT 

4 01 CCATCTGCTT CTCGCCGCTG GTGCGGATAT TGCCGTTGTC C AC C GT C AAA 

4 51 TCTATGgtgG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCACCCGG 

501 CCCGCCGGTA ATGACAAACT GCGGATTGTG GCGGTGCAGG GATTCGCAAT 

551 CGGGCTCAAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAG 

This corresponds to the amino acid sequence <SEQ ID 166; ORF 042.ng>: 

g0 42 .pep 

1 MTMICLRFQA FVPHTSALSN TSTAAGPSCP MAAVRSMMKI QPGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRFWA NSASICAFNS ATRASLPKIR DR VSICFSPL VRILPLSTV K 

151 SMVVAFFANC SYASAPGPPV MTNCGLWRCR DSQSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 167>: 
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m042 . seq 




1 


ATGACGATGA 


51 


GTTATCCAmT 


101 


TACGGTCGAT 


151 


AAGGAAACAG 


201 


CGGTAGGCCG 


251 


CGAAGGCGGA 


301 


TTGCCTTTGG 


351 


CTTCAATTCC 


401 


CCATCTGCTT 


451 


TCTATGGTGG 


501 


CCCGCCGGTA 


551 


CGGGCTCGAA 


601 


AAATAA 



TTTGCTTGCG CTTCCAAGCG 
ACTTCGACAG CCGcCGGCCy 
GATGAAAATC CAATCGGGGT 
GCTGCCCGTG CCCCTCGTTG 
ATGTCGCCGT GTATCCAACT 
CACCTTGTTG CCCGTAACCG 
CGGCTTCGCG CGTTTGGGCG 
GCCGCGCGCG CTTCCTTGCC 
TTCGCCGCTG GTGCGGATAT 
TCGCGTTTTT CGCTAACTGT 
ATGACAAGCT GAGGATTGTA 
CAGCGTCCCC ACCGTTGCCG 



TTCGTGCCGC GTACCAGCGC 
TTCyTGCCCG ATGGCGGCGG 
TTTTCTCTTT GATGTATTCG 
CGTAAAGATT CGTCTACAGG 
TGCCAACCGC GACTGCGTGC 
ACAGCACCAG CCCGCGTCCT 
AACAGCGCGT CAATCTGCGC 
GAAAATCCGC GCCAAGGTCT 
TGCCGTTGTC CACCGTCAGA 
TCATACGCTT CCGCGCCCGG 
GCGGTGCAGG GCTTCGTAAT 
CCTTGTCAAA TGCAGGCTGC 



This corresponds to the amino acid sequence <SEQ ID 168; ORF 042>: 

m042 -pep 

1 MTMICLRFQA FVPRTSALSX TSTAAGXSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 

151 SMVVAFFANC SYASAPGPPV MTSXGLXRCR ASXSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 169>: 

a042 . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

3 51 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 
4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 
501 CCCGCCGGTA ATGACAAGCT GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 
551 CGGGCTCGAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 
601 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 170; ORF 042. a>: 

a042 .pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 

151 SMVVAFFANC SYASAPGPPV MTS*GL*RCR AS*SGSNSVP TVAALSNAGC 

201 K* 



m042/a042 99.0% identity over a 201 aa overlap 



10 20 30 40 50 60 

m04 2 .pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I f I M I I I I I I I I I I I M I I I I I I I I I I II I I M I I I I I I I I I i II I | I M I ! I II I 
a04 2 MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 2.pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I M I I II I II I M I I I I I I I I I I I I I I I I I I I I I M I I I II I I I M I I I I I I I I I M | | 
a04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 



m04 2 . pep 



130 140 150 160 170 180 

AARASLPKI RAKVS I C FS PLVRI LPLSTVRSMVVAFFANCS YAS APG PPVMTSXGLXRCR 
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M M II It I I I I I I I I I II I I I I I I I 1 I I I I I j I | | M I I I I I I I I I I I I I I I M I i I I I 
a04 2 AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 

130 140 150 160 170 180 

190 200 
m04 2 . pep ASXSGSNSVPTVAALSNAGCKX 
I I II I I ! I I II I I I I I I II I I I 
a04 2 ASXSGSNSVPTVAALSNAGCKX 

190 200 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 042 shows 93.0% identity over a 201 aa overlap with a predicted ORF (ORF 042.ng) 
from N, gonorrhoeae; 

m042/g042 

10 20 30 40 50 60 

m042 .pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
M I I II I I I I I I I : I I I M I I I I I I I I M I I I M I M I I I I I II I I I I i I I I I I I I I 
g04 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2 .pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I ! I I I I I I II I I | M I I I I I I I 
g04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 2.pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 
1:11111111 : I M I I I I I I I I I I I I I I : M I I I i M I I I I I I I II I I I I I : II III 
g04 2 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMWAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

190 200 
m04 2 . pep ASXSGSNSVPTVAALSNAGCKX 
I I I I I I I I I I I I I I I ! I I II 
g04 2 DSQSGSNSVPTVAALSNAGCKX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 171>: 

m04 2-1 . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCTAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 172; ORF 042-1 >: 

m042-l .pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 
151 SMVVAFFANC SYASAPGPPV MTS* 



m042-l/g042 95.4% identity in 173 aa overlap 
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10 20 30 40 50 60 

m04 2-l .pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

I I I I I M I I I I II : I I I M I I I I I I I I I I I I I I | I U | I I I I I I I I II I I II I 

g04 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2-l .pep RKDS STGGRPMS PC I QLANRDCVPKADTLLPVTDSTS PRPLPLAAS RVWANSAS I CAFNS 
M I I 1 I I I I I I I II I II I II I I I I I I I f I i I I I I II I I I I I I I | | | | MINIMUM 
g04 2 RKDS STGGRPMS PC I QLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSAS I CAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m04 2-l .pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSX 

I : I I I I I I I I : I I i I I I I I I I I I I t M I : I I I I I I I ! I I M II M I I I I I I : 
g04 2 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMVVAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

g04 2 DSQSGSNSVPTVAALSNAGCKX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 173>: 

a042-l .seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT AC T T CG AC AG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 174; ORF 042-l.a>: 

a042-l .pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVR SMMK I QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR AKVSICFSPL VRILPLSTVR 
151 SMVVAFFANC SYASAPGPPV MTS* 



m042-l/a042-l 100.0% identity in 173 aa overlap 

10 20 30 40 50 60 

m042-l.pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I M I I I I I I II I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a04 2-l MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m042-l . pep RKDS STGGRPMS PCI QLANRDCVPKADTLLPVTDSTS PRPLPLAASRVWANS AS ICAFNS 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | M | | | M I I I I I I I I I I I M | | 1 
a042-l RKDS STGGRPMS PC IQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 



130 140 150 160 170 

m042-l . pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSX 

I I I I I I I I I I I I I I I I I I | | | | | | | | M I I I I I M I I I I 

a042-l AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 175>: 
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g04 3 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GTCGGCCCAT CAGCACTTTT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GCCTGAACGC TTCgttgaAC 

101 CGTCCCGCGT ggcggtagcc gcAAAAGTGC ATcGCGGCTT GGATGGTGCT 

151 GCCCGATTCG ATGAGGGcga gcGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

2 01 GTCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

2 51 CATTCGTTCA GCCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGGCG 

301 GGCGAATTCG CTGTTCAAAA TATCGGCGGC TTCGTCTATG CGCCGGCGGC 

351 GGTAGCCGTT GTCGTGGCGG CGGAAGGTGA AGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 176; ORF 043.ng>: 

g043 . pep 

1 MVVSNQNIYA VGPSALFHIR RQKSVMPPER FVEPSRVAVA AKVHRGLDGA 

51 ARFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQPDA AGDFGDGQRA 

101 GEFAVQN IGG FV Y A P AAV A V VVAA EGEA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 177>: 

m04 3 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 

151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAgGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

2 51 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG T C AG CG G AC G 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 178; ORF 043>: 

m04 3 .pep 

1 MVVSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQS DA AGDFGDGQRT 

101 GEFVLQDVGG FVYAPTAVTV WAAEGEAQ* 

Computer analysis of the amino acid sequences gave the following results: 

Homology with a predicted ORF from N. meningitidis menA with menB 

ORF 043 shows 89.8% identity over a 128 aa overlap with a predicted ORF (ORF043.a) from 

N. gonorrhoeae: 

m043/g043 

10 20 30 40 50 60 

m04 3 . pep MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I I I M I I I I I : I I II I : i I I i I M I I I I II I I I I I M I I I I ! I Mill! MINIM 
g043 MWSNQNIYAVGPSALFHIRRQKSVMPPERFVEPSRVAVAAKVHRGLDGAARFDEGERVF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 3 . pep Q PQAAQASGDG FAG LRFE I A FQVAFVQS DAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 

I I I M II I I II I I I M II I I I I ! I I I I I I I I M I I I ! I M I I : : I : M I I I II I : I I : I 
9043 Q PQAAQASGDG FAG LRFE I A FQVAFVQPDAAGDFGDGQRAGEFAVQN I GGFVYAPAAVAV 

"70 80 90 100 110 120 

130 

m0 4 3 . pep VVAAEGEAQX 

I I I I I I I I 

g04 3 VVAAEGEAXX 

130 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 79>: 

a043 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 



WO 99/57280 



PCT/US99/09346 



242 



151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

251 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 



This corresponds to the amino acid sequence <SEQ ID 180; ORF 043.a>: 

a04 3 . pep 

1 MVVSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQD VGG FVYAPTAVTV WAA EGEAQ* 



m043/a043 100.0% identity in 129 aa overlap 



10 20 30 40 50 60 

m04 3 . pep MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I I M I t M I I I I I I I I I M I I I I 11 I M I I II M M I I I I I I I I I i I I I I I I t I I II II I 
a04 3 MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 

10 20 30 40 50 60 



70 80 90 100 110 120 

m0 4 3 - pep Q PQAAQAS G DGFAGLRFE I AFQVAFVQSDAAGDFGDGQRT GEFVLQDVGG FVYAPTAVTV 

I I I I I I I I I I I I I I I I M I I II I I I I I M I II II I I I I I I I I I I I I I I I I I I I ! I I I I I I 
a04 3 Q PQAAQAS G DGFAGLRFE I AFQVAFVQSDAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 

70 80 90 100 110 120 



130 

mO 4 3 . pep VVAAEGEAQX 
I I II I I I I I I 
a04 3 VVAAEGEAQX 

130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 181>: 

g044 . seq 

1 ATGCTGCCCG ACCAGAGCGT CGAGTTCTTG CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGCCC TTCGATAACG GCGGTCAGCT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 CGGCTGCCGT AGCGCATTAA 



This corresponds to the amino acid sequence <SEQ ID 182; ORF 044.ng>: 

g044 .pep 

1 MLPDQSVEFL PQ VWFPGLF GGGFP AVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQP FDNGGQLHAV VGGLRFAAEK F F F AAA V AH * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 183>: 

m044. seq 

1 ATGCCGTCCG ACTAGAGCGT CGAGTTCTTT CCAGAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGAGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCATTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCAGTCAGTT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTACCGT AGCGCAyTAa 



This corresponds to the amino acid sequence <SEQ ID 184; ORF 044>: 

m044 .pep 

1 MPSDXSVEFF PE VWFDGLF GGGFPAVAL P TVYPVFHAIF DVLRVGADDD 
51 GAAAFERFQS FDDGSQFHAV VGGLRFAAEK FFFVATVAH* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 185>: 

a044 . seq 

1 GTGCCGTCCG ACCAGCGCGT CGAGTTCTTT CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCGGTCAGTT 

201 CCATACGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 186; ORF 044.a>: 

a044 . pep 

1 VPSDQRVEFF PQ VVVFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQS FDDGGQFHTV VGG LRFAAEK FFFVAAVAH * 

m044/a044 91.0% identity over a 89 aa overlap 

10 20 30 40 50 60 

m04 4 . pep MPS DX S V E F F P EVW F DG L FG GG F P A V AL PT V Y P V FH A I FD V LRVG ADDDG AAA FE R FQ S 

: I I I I I I I i : I M M M M M 1 M I I ! I I II M II I : I M I M M I I M I I M M i M 
a04 4 VPSDQRVEFFPQWVFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQS 

10 20 30 40 50 60 

70 80 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
I M I : I I f : I I I I it I II M I I I I I : I I I I 
a04 4 FDDGGQFHTWGGLRFAAEKFFFVAAVAHX 

70 80 90 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 044 shows 86.5% identity over a 89 aa overlap with a predicted ORF (ORF 044.ng) 
from N. gonorrhoeae: 

m044/g044 

10 20 30 40 50 60 

m044 . pep MPSDXSVEFFPEWVFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 

I I I I I I : I • 1 1 1 I I I M 1 1 1 1 I I I 1 1 I II 1 1 1 1 I h I I 1 1 1 I I I I I I I 1 1 II I I I I 

g 0 4 4 MLPDQSVEFLPQWVFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQP 

10 20 30 40 50 60 

70 80 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 

I I • h I - M ! M M I II M M M • I • 1 1 1 1 

g044 FDNGGQLHAWGGLRFAAEKFFFAAAVAHX 

70 80 90 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 187>: 

g046 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGC CCGCCGCgcc gCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC gaATATGGAA AGGCTGCCGt TTTcGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TtcgctGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGagaGCGCG AGcagcaagt cggcatcttC 

351 CgcgccggcG Cgttataatg tgAAGGGGGA TGCGccgttg ccgaAAACGG 

4 01 TTTGGacatc gaggcggctg CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

451 TCGATAAcgg TTACGTCGTT GTTGGTGATG GCGGCAAGGT TTTGCGCGAC 
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501 GGTAGAACCT ACCTGCCCGT TGCCTAAAAT GAGGATTTTC ACGGTATGGG 

551 TCGCCGGGTG A 

This corresponds to the amino acid sequence <SEQ ID 188; ORF 046.ng>: 

g046 -pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RYNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 S ITVTSLLVM AARFCATV EP TCPLPKMRIF TVWVAG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 189>: 

m046 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAsT CCGC.r.sGC gCGcCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

2 01 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 
251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

3 01 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

3 51 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

4 01 TTTGGACATC GAGGCGGCTG C CTGTTT C CT GCAATGCTTT TTCGTCGATG 
451 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 
501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 
551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 190; ORF 046>: 

m046 .pep 

1 MSAMLRPTSX PXXRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 S I TVTSLLGM AARFCATV EP TCPLPKMRIF TVWVAE* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 191>: 

a046 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGT CCGCCGCGCC GCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

4 01 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

4 51 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 192; ORF 046.a>: 

a04 6.pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE* 

m046/a046 98.4% identity over a 186 aa overlap 

10 20 30 40 50 60 

m04 6 .pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
MINIMI 1 I M M M M M M I M 1 M M M M M M M M 1 M M M I I M M I 
a04 6 MSAMLRPTSS PPRRACMMTI RTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m04 6.pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 
I I I I I I I I I I I I 1 I I I I I I M I I I I I I I I M i I I M I I I I I I I I I M II II M I I I I I I I 
a04 6 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 6 . pep RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 
I I II I i I I I I I I I I II I I I I I I I I I I I I I I I I M It I M I I I I M I I I I I I I I I I M I I I 
a04 6 RSNVKGDAPLPKTVWT SRRLPVSCNAFS SMS I TVTSLLGMAARFCATVE PTCPLPKMRI F 

130 140 150 160 170 180 



m04 6.pep 
a046 



TVWVAEX 
I I I I I I I 
TVWVAEX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 046 shows 97.3% identity over a 1 85 aa overlap with a predicted ORF (ORF 046.ng) 
from N. gonorrhoeae: 

m046/g046 

10 20 30 40 50 60 

m046 .pep MSA^RPTSXPXXRAC^TIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 

1 1 1 1 1 1 1 1 1 I MMIIMIIIMIIMIIIIIIIMIIIIIMIIIMIIIMMII 

g046 MSAMLRPTSSPPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 6 . pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

MM Mill III II M Ml IM I II MIMI IMI IMI II M I M M I I M M IMMM 

g04 6 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

70 80 90 100 110 120 

130 . 140 150 160 170 180 

m04 6 . pep RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

I MllimMHMIMMMMMIMMMMII M 1 1 1 1 ! M 1 1 M 1 1 1 M 1 1 1 

g04 6 RYNVKGDAPLPKTVWTSRRLPVSOTAFSSMSITVTSLLVMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m04 6.pep TVWVAEX 

Mill 

g04 6 TVWVAGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 193>: 

g047 . seq 

1 ATGGTCATCA TACAGGCGcg gcGCGGCGGG CTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 AT CGAAGG CG ACGAAATCCT GTTTGCCGCC GCCGCCGAAA ACATCGGGGC 

201 GGTCATACCc gaATTGCGCC CCAAAGAAAC CAGCACCCGC CG CAT C ATG A 

251 TTGCCGGCGG CGGCAACATc tgctACCGCC TCGCCAAGCA GCTCGAACAC 
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301 GCATAcaacG TCAAAATCAT CGAATGCCGG CCGCGCcgtg ccgaATGGAT 

351 AGCCGAAAAC ctcgAcaaCA CCCTCGTCCT GCAAGGTTCG Gcaaccgacg 

4 01 aAaccctgct cgAcaacgaa tacatcgacg aaatcgaCGT ATTCTGCGCC 

4 51 CTGACCAACG ACGACGAAAG CAACATTAtg tCCGCCCTTT TGGCGAAAAA 

501 CCTcggcgCG AAGCgcgtca tcggCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

6 01 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CAT CGAAG TC GTCGCGCACG 

701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGC AT CAG CGGCATCAAA 

751 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGTGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAGAAACTC 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 194; ORF 047.ng>: 

g047 -pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP EL.RPKETSTR RIMIAGGGNI CYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDE1DVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNTRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGET I M GHHTETVIQD GDKIIFFVSR RRI LNELEKL 

3 01 IQVKMGFFG* 



The following partial DNA sequence 



m047 . seq 






1 


ATGGTCATCA 


TACAGgCGcG 


51 


TGCCGACATC 


GCCCAAGATT 


101 


CCGTTTACCG 


CAACAACCGC 


151 


AT CGAAGGCG 


ACGAAATCCT 


201 


GGTCATACCC 


GAATTGCGCC 


251 


GcATCATGAT 


TkCCGGCGGC 


301 


CTCGAACACG 


CATACAACGT 


351 


CGAATGGATA 


GCCGAAAACC 


401 


CAACCGACGA 


AACCCTGCTC 


451 


TTCTGCGCCC 


TGACCAACGA 


501 


GGCGAaAAAC 


CTCGGCGCGA 


551 


GCTACGTCGA 


TTTGCTCGAA 


601 


CACCTCATCA 


CCATCGGCTC 


651 


CGTTGCCGTC 


CACCCCATCC 


701 


TCGCACACGG 


CGACAAAAAA 


751 


GGCATCAAAT 


GGCCCGAAGG 


801 


AACCGG CGAA 


AC C ATTATGG 


851 


GCGACCACAT 


CATCTTTTTC 


901 


GAAAAACTCA 


TCCAGGTCAA 



identified in N. meningitidis <SEQ ID 195>: 

C . . syGCGGA STGCTTGTCG GACGCAGCAT 
TGCCCGACGG GGCCGACTGC CAAAT CTGCG 
CTCATCGTCC CCGCGCCGCA AACCGTCATC 
ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 
CCAAAGAAAC CCAAAGAAAC CAGCcCmgtnm 
GGCAACATCG GCTACCGTCT CGCCAAGCAG 
yAAAAT CATC GAATGCCGGC CGCGCCGTGC 
TCGACAACAC CCTCGTCyTG CAAGGTTCGG 
GACAACGAAT ACATCGACGA AATCGACGTA 
CGACGAAAGC AAC AT TATGT CCGCCCTTTT 
AGCGCGTCAT CGGCATCGTC AACCGCTCAA 
GGCAACAAAA TCGACAT CGT CGTCTCCCCC 
GATACTCGCC CACATCCGGC GCGGCGACAT 
GGCGCGGCAC GGCGGAAGCC AT CGAAGTCG 
ACTTCCGCCA TCATCGGCAG GCGCATCAGC 
CTG CCACATT GCCGCCGTCG TCCGCGCCGG 
GACACCATAC CGAAACCGTC ATCCAAGACG 
GTCTCGCGCC GGCGCATCCT GAACGAACTG 
AATGGG CTTT TTCGGATAA 



This corresponds to the amino acid sequence <SEQ ID 196; ORF 047>: 

m047 .pep 

1 MVIIQARXXG XLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETQRN QPXXIMIXGG GNIGYRLAKQ 

101 LEHAYNVKII ECRPRRAEWI AENLDNTLVL QGSATDETLL DNEYIDEIDV 

151 FCALTNDDES NIMSALLAKN LGAKRVIGIV NRSSYVDLLE GNKID IWSP 

201 HLITIG5ILA HI RRGDIVAV HP I RRGTAEA I EWAHGDKK TSAIIGRRIS 

251 GIKWPEGCHI AAWRAGTGE TIMGHHTETV IQDGDHIIFF VSRRRI LNEL 

3 01 EKLIQVKMGF FG* 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 197>: 

a047 . seq 

1 ATGGTCATCA TACAGGCGCG GCGCGGCGGA CTGCTTGTCG GACGCAGCAT 
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51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATC GGCTACCGTC TCGCCAAGCA GCTCGAACAC 

301 GCATACAACG T C AAAAT CAT CGAATGCCGG CCGCGCCGTG CCGAATGGAT 

351 AGCCGAAAAC CTCGACAACA CCCTCGTCCT GCAAGGTTCG GCAACCGACG 

4 01 AAACCCTGCT CGACAACGAA TACATCGACG AAATCGACGT ATTCTGCGCC 

4 51 CTGACCAACG ACGACGAAAG CAACATTATG TCCGCCCTTT TGGCGAAAAA 

501 CCTCGGCGCG AAGCGCGTCA TCGGCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCACACG 

701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 

751 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GG AC AC CAT A CCGAAACCGT CATCCAAGAC GGCGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAAAAACTC 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 



This corresponds to the amino acid sequence <SEQ ID 198; ORF 047. a>: 

a047 .pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI GYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHIIFFVSR RRILNELEKL 

301 IQVKMGFFG* 

m047/a047 96.5% identity over a 3 12 aa overlap 



10 20 30 40 50 60 

m047 . pep MVIIQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 

I I II I II I I I II I I I I II I I I M I I I I I I I I I I M I I M I I I M I I I I I I I M I M I 
a04 7 MVI I QARRGGLLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 7 .pep AAENIGAVI PELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKI IECRPRRAEWI 
I I M I I I I I I I I I I I I I : : II I I I I I I M I I If I I I I I I I I I I I I I I I I I I I I I 
a047 AAENIGAVIPELRPKETSTRR IMIAGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 

70 80 90 100 110 



130 140 150 160 170 180 

m047 .pep AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 

I [ M II I I I M II I 1 I 1 I 1 I I I M M I I I I I I 1 I I I 1 t I M 1 I I It t I 1 I i t M I I I I I I 
a04 7 AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 

120 130 140 150 160 170 



190 200 210 220 230 240 

m04 7 .pep NRSSYVDLLEGNKIDIVVSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 

I I I I I I I I I I I I I I I I I I I I I II I I I I M I II 1 II I I I I I I I I I I I I I I I I I I I I I I I I I 
a047 NRSSYVDLLEGNKIDIVVSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 

180 190 200 210 220 230 



250 260 270 280 290 300 

m0 47 .pep TSAIIGRRISGIKWPEGCHIAAVVRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 

I I M M M II I I 1 II I I I I I I I I I I M I I M I M I II I I I I I M II M I I II I M II ! I I 
a047 TSAIIGRRISGIKWPEGCHIAAVVRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 

240 250 260 270 280 290 



m047 . pep 



310 

EKLIQVKMGFFGX 
I I I I I I I II I I I I 
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a0 4 7 EKLIQVKMGFFGX 
300 310 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 047 shows 96.2% identity over a 312 aa overlap with a predicted ORF (ORF 047.ng) 
from N. gonorrhoeae: 



m047/g045 






m047 . 


.pep 


MVT TOARYYnYT.Vf^P^T AHT AOnT.PnrJADPOTPAVYPWNRTjTVPAPOTVT TFnnPTT.FA A 
n v _l x ynJvAAOAij v ox\^ xnL/xriyiJjjjruw/iuuy.ii v x iajlv in x\.xj x v c^.zr\^ j. v x x uvjuri x j*jc nri 

Illllll 1 MMMMMMMMMMM MMMMMM MMMMMMMI 

MVI IQARRGGLLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 


o \J 


g047 




6 0 


m047 , 


.pep 


AAEN I G AV I P E LRP KE TQRNQ PXX I M I XGGGN I G Y RLAKQL EHAYNV KI I E CRP RRAEW I 

1 1 1 1 f f 1 1 1 1 1 1 J 1 1 f 1 = = IN Mill I f M 1 1 1 1 1 1 ! M 1 1 1 1 11 1 1 1 1 1 1 i 

AAENIGAVIPELRPKETSTRR IMI AGGGNI CYRIAKQLEHAYUVKI I ECRPRRAEWI 


120 


g047 




117 


m047 , 


.pep 


AENLDNTLVLQGS ATDETLLDNE Y I DE I DVFCALTNDDE SN I MS ALLAKKf LGAKRVI G I V 

MMIIMNIMMMIIIIIIIIIIIIIIIIMIIMIMIIIIIMIIIIMIIIII 

AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALIAKNLGAKRVIGIV 


180 


g047 




177 


m04 7. 


■pep 


NR S S YVDLLtEGNK I D I WS PHLITIGSI LAH I RRGD I VAVH P I RRG TAE A I E WAHGD KK 

II M 1 1 1 ! 1 1 M 1 II 1 M M 1 M 1 II 1 1 1 ! 1 II 1 1 1 1 i 1 1 M 1 1 M Ml 1 M M II 1 1 II 

NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 


240 


g047 




237 


ra047. 


•Pep 


TSAI IGRRISGIKWPEGCHI AAWRAGTGETIMGHHTETVIQDGDHI I FFVSRRRILNEL 


300 


g04 7 




IIIIMIIMIMIIIIIIIMIIIIMMIIIIIMIIIIMIIIIMIIIIIMIIII 

TSAI IGRRISGIKWPEGCHIAAVVRAGTGETIMGHHTETVIQDGDHI IFFVSRRRILNEL 


297 


m047. 


•pep 


EKLIQVKMGFFGX 313 
IMIIIIIIIIM 




g047 




EKLIQVKMGFFGX 310 





The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 199>: 

g048 . seq 

1 ATGCTCGACA 

51 TTACTACGTc 

101 CAGGTCCGAC 

151 AAACAAACCG 

201 cacctGCGAA 

251 GCGGCGCGGC 

3 01 GCGTTCCCCG 

3 51 TATGCCCGTA 

4 01 CCGCCCCGCG 
451 TGA 

This corresponds to the amino acid sequence <SEQ ID 200; ORF 048. ng>: 

g04 8 .pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 KQTGLLGMIG KSERGAATCE AIADNK AVYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAV YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 20 1>: 

m04 8 . seq 

1 ATG CTCAACA AAGG CGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCGG 



AAGGCGAGGA GTTGCCCGTC 
ggcCCcgTCG ATCCGGTCGG 
CACAGCCACC CGCATGGACA 
GCCTCTTGGG CATGATCGGC 
GCcatCGCCG ACAACAAGGC 
ATACCTCGTG GCAAAAGCCA 
AATTGGGTAT GGAAGCCGTT 
ACCGTCGCCG TGGACAGCAA 
CAAATGGCAG GCGAAAATCG 



GATTTCACCA ACCGCCTGAT 
CGATGAAGTC GTCGGTCCCG 
AATTTACCCG CCAAATGCTC 
AAATCCGagc gcgGcgcggc 
CGTGTACCTC ATGGCAGTCG 
TCAAATCTTC CAAAGTCTTG 
TACGAATTTG AAGT CAAAGA 
AGGCGAATCC ATCCACGCCA 
GCATCATCCC CGTCGAGTCT 
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101 


CAGGTCCGAC 


CACAGCCACC 


151 


GAACAAACCG 


ACCTCTTGGG 


201 


CACCTGCGAA 


GCCATCGCCG 


251 


GCGGCGCGGC 


GTATCTCGTG 


301 


GCGTTCCCCG 


AATTGGGCAT 


351 


CATGCCCGTA 


ACCGTCGCCG 


401 


CCGCCCCGCG 


CAAATGGCAG 


451 


TGA 





CGCATGGACA AATTCACCCG CCAAATGCTC 
CATGATCGGC AAATCCGAGC GCGGCGTGGC 
ACAACAAAGC CGTGTACCTC ATGGCAGTCG 
GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 
GGAAGCCATT TACGAATTTG AAGTCAAAGA 
TAGATAGCAA AGGCGAATCC ATCCACGCCA 
GCGAAAATCG GCATCATCCC CGTCGAATCT 



This corresponds to the amino acid sequence <SEQ ID 202; ORF 048>: 

m04 8 . pep 

1 MLNKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 EQTDLLGMIG KS E RGVAT C E AIADNKA VYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 203>: 

a048 . seq 

1 ATGCTCGACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGACGAAATC GTCGGCCCAG 

101 CAGGTCCGAC CACCGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGCGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

2 51 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

301 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

351 CATGCCCGTA ACCGTCGCCG TAGACAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCC CCAATGGCAG GCGAAAATCG GCATCATCCC CGTCAAATCT 

4 51 TGA 

This corresponds to the amino acid sequence <SEQ ID 204; ORF 048.a>: 

a04 8 .pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEI VGPAGPTTAT RMDKFTRQML 

51 EQTDLLGMIG KSERGAATCE AIADNKA VYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPPQWQ AKIGIIPVKS 

151 * 



m048/a048 96.0% identity over a 150 aa overlap 

10 20 30 40 50 60 

MLNKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLEQTDLLGMIG 
I I : I I M I I I M 1 E I I I I II I i f I I M II : II M II M I I I I II I I I I I I I I ! I I I I I I I 
MLDKGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 
10 20 30 40 50 60 



m04 8 -pep 
a048 



70 80 90 100 110 120 

m04 8 . pep KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPV 

II I I I : I I I I I I I I I I I I I ! I I I I I I I II II I I I I I I II I I I II I I I I I I I I I I I I I || I 
a04 8 KSERGAATCE A I ADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPV 

70 80 90 100 110 120 



130 140 150 

m04 8 .pep TVAVDSKGES IHATAPRKWQAKIGI I PVESX 

I I I I I I I I I I I I I I I I : I I I I I I M I I : I I 
a04 8 TVAVDSKGES I HAT APPQWQAKIG 1 1 PVKSX 

130 140 150 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 048 shows 96.4% identity over a 150 aa overlap with a predicted ORF (ORF 048.ng) 
from N. gonorrhoeae: 
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m048/g048 

10 20 30 40 50 60 

m04 8 . pep MLNKGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

l^i M I ,i Mil! M MMMM li II I IMNII III M MM I I I I :i I MINI 

904 8 MLDKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLKQTGLLGMIG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 8 . pep KSERGVATCEAI ADNKAVYLMAVGGAAYLVAKAI KSSKVLAFPELGMEAI YEFEVKDMPV 

1 1 i M : M 1 1 ! M II II !l 1 1 1 II 1 1 N 1 1 1 1 M 1 1 1 1 1 1 1 1 : 1 1 I ! I h I ! 1 1 , 1 1 1 !| 

g04 8 KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAV YEFEVKDMPV 

70 80 90 100 110 120 

130 140 150 

m04 8 .pep TVAVDSKGES I HATAPRKWQAKIGI IPVESX 

Mi Ml II II III iMMIII Ml I Ml, I I 

g04 8 TVAVDSKGESIHATAPRKWQAKIGI IPVESX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 205>: 

g049 . seq 

1 ATGCGGGCGC AGGCGTTTGA TCAACCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTT CAG GATATTGATT 

101 TGGACGGGCA TCAACGCCTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCGG ATTCTGCCGC ATCGGCGTTT TCCCCGCCCT 

201 CAATCTGTGC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC AT CGAAC CGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAAccggca t TTGCAGGGA 

3 01 AGCCTgcgcg TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 
351 CGACTTCCTC GCCGCAATCG GCAACGGCg c t GTTGTGTT C TTCCTGCCAT 

4 01 TTCTTCAGAT ACGCCTTTAA 



This corresponds to the amino acid sequence <SEQ ID 206; ORF 049.ng>: 

g049 -pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRL FRTAFAVFRN 

51 PVCRRTGFCR IGVFPALNLC GFKFGTVFFG IEPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGAWF FLPFL QIRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 207>: 

m049.seq (partial) 

1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG GATATTGATT 

101 TGGACGGGCA TCAACGTTTC TTCCGCATCG TTTTCCCCGT TTTCCGAAAC 

151 CGCCGGCTCA TTCGTGCCGG ATTCTGCCTC GTCGGCGTTT TCCCCGCTTT 

201 CAATCTGTCC GGTTTCAAAT TCGACACTGT CTTTTTTGGT ATCAAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA TTTGCAGGGA 

3 01 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

3 51 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

4 01 TTTTTCAGAT ACGCCTT . . . 



This corresponds to the amino acid sequence <SEQ ID 208; ORF 049>: 

m049 .pep (partial) 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRF FRIVFPVFRN 

51 RRLIRAGFCL VGV FPAFNLS GFKFDTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFDFL AAIGNGGIVF LLPFFQIRL. 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 209>: 

a049.seq 
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1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG AATATTGATT 

101 TGGACGGGCA TCAACGCTTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCCG ATTCTGCCGC ATCGGCGTTT TCCCCGCCTT 

201 CAATCTGTCC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC ATCAAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA TTTGCAGGGA 

301 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

4 01 TTTTTCAGAT ACGCCTT 

This corresponds to the amino acid sequence <SEQ ID 210; ORF 049.a>: 

a04 9 .pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ NIDLDGHQRF FRTAFAVFRN 

51 PVCRRTRFCR IGVFPAFNLS GFKFGTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGGIVF LLPFF QIRL 

m049/a049 90.6% identity over a 139 aa overlap 

10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

I I I t i I f I I I II I I I I I I I I I I I I I I I M I : I I I I I I I I I I I : I MM I : M 
a04 9 MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQNIDLDGHQRFFRTAFAVFRNPVCRRTRFCR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 9.pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 
: I I I I M I I M I I I M I I M I I I I I I I M II I I II I I II II II M I I II I M II I I I I I 
a04 9 IGVFPAFNLSGFKFGTVFFGIKPDSPPRFDVFFRNRHLQGSLRVE PVFLK DDHRVGFDFL 

70 80 90 100 110 120 

130 139 
m04 9.pep AAIGNGGIVFLLPFFQIRL 

II I I II I M I II I I M M I 
a04 9 AAIGNGGIVFLLPFFQIRL 

130 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 049 shows 86.3% identity over a 139 aa overlap with a predicted ORF (ORF 049.ng) 
from N. gonorrhoeae: 

m049/g049 

10 20 30 40 50 60 

m049 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDI DUDGHQRFFRI VFPVFRNRRL I RAGFCL 

I I I II M : II I I ! I I I I I I I I I I II INI I I h I Mi I : I : | in, MM 

904 9 MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRLFRTAFAVFRNPVCRRTGFCR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 9 . pep VGVFPAFNLSGFKFDTVFFGI KPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

= MIII = li UN I III II :| :; : II |!M I III MIMM :i MM I! II, Mill 

904 9 IGVFPALNLCGFKFGTVFFGIEPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

70 80 90 100 110 120 

130 139 
m04 9 . pep AAIGNGGIVFLLPFFQIRL 

I I I I I I = = I I = i I I = I I I 1 
g04 9 AAIGNGAWFFLPFLQIRLX 

130 140 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 21 1>: 

g050 . seg 

1 atgggcgCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGg 

51 cacgcccGAA AAAGccgtgt TGATGGcaaA AGAATCCCTG ATGAGCCACA 

101 TCGAcatCca aGaATTGCAG GAAAAAGCCG CGTccggggc ggaattgtcc 

151 accaccgaAG CcCTGCGCCT cGAACTCTTT GAAAAGGTCA ACGCGCTGGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

2 51 TCCTCGATTA CCCGACCCAT GCCGCCTCCA AACCGATTGC CATGATTCCC 

3 01 AACTGTGCcg ccacCCGcca cgtcgAATTT GAATTGgACG GCTCAGGtcc 
351 TGTCGAactc acgccGCcgc gtgtCGAAGA CTGA 



This corresponds to the amino acid sequence <SEQ ID 212; ORF 050.ng>: 

g050 -pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKI LDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVED* 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 213>: 

zn050 . seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGTATCG GCATCGGCGG 

51 C.agCCgAA AAAGCCGTGC TGATGGCAAA AGAGTCCCTG ATGAGCCACA 

101 TCGACATTCA AGAATTGCAG GAAAAGGCCG CGTCCGGCGC GgAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTGGG 

201 CATCGGCGCA CAAGGCTTGG GCGGACT GAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA TCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

3 01 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA TGGCCCGATT TGA 



This corresponds to the amino acid sequence <SEQ ID 214; ORF 050>: 

m050 .pep 

1 MGAGWCPPGI LGIGIGGXAE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKI LDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDGPI * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21 5>: 

a050 . seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGG 

51 TACGCCCGAA AAAGCCGTGT TGATGGCGAA AGAATCCCTG ATGAGCCACA 

101 TCGACATCCA AGAATTGCAG GAAAAAGCCG CGTCCGGCGC GGAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTAGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA CCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA CTGGCCC 

This corresponds to the amino acid sequence <SEQ ID 216; ORF 050.a>: 

a050 . pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDWP 



moso/aoso 97.7% identity over a 129 aa overlap 



10 20 30 40 50 60 

m050 . pep MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
I I M I I I I I I I I I I I I I : I I ! I I I I I I I I I 1 I I I II I I I I I I I I I I I I I I I I I I I I f I I 
a050 MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m050 . pep EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 
M I I I I I I I I I I I I II I I I I I M I ! II i I I I I I I I I I I I I f | | ) | | | | | | | | | | | M I I I 
a050 EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

70 80 90 100 110 120 

130 

mOSO.pep TPPRVEDGPIX 

II I II I I I 
a050 TPPRVEDWP 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from vY gonorrhoeae 

ORF 050 shows 98.4% identity over a 127 aa overlap with a predicted ORF (ORF 050.ng) 
from N. gonorrhoeae; 

m050/g050 

10 20 30 40 50 60 

m050 .pep MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

MMIMIMMIIMh IIIIIIMIIIMIIIMIIIIIMIIIIIIIIIIIIMII 

9050 MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m050 .pep EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

IIMIMIIIMIIIIMIIIMIMIIIIIIIIIIIIIIIIMIIIIIIMMIIIMI 

905 0 EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

70 80 90 100 110 120 

130 

mO 5 0 . pep TPPRVEDGPIX 

lllllll 
gO 5 0 TPPRVEDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 217>: 

g050-l . seq 

1 ATGACCGTTA TCAAGCAAGA AGACTTTATT CAAAGTATCT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCAAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCGGA CATGAGCGTG GAAAAGATGG TTAACGAAGG CGTACGCCGC 

301 GCCTACACTT GGGAAGGCAA CACCCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCACA 

4 01 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

4 51 GGCGGCTCTG AAAACAAATC CAAACTCGCT ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACGATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGCAcgcC CGAAAAAGCC 

601 GTGTTGATGG cgaAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAG GTCAACGCGC TGGGCATCGG CGCGCAAGGC 

7 51 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCATGCCGCC TCCAAACCGA TTGCCATGAT TCCCAACTGT GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GTCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGACC CGATCTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGATAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCGCACAAAC GCCTCGTCAA TATGCTCGAC AAAGGCGAGG AGTTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCC GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

12 01 AAATTTACCC GCCAAATGCT CAAACAAACC GGCCTCTTGG GCATGATCGG 
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1251 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAGG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CATACCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGTA TGGAAGCCGT 

1401 TTACGAATTT GAAGTCAAAG ATATGCCCGT AACCGTCGCC GTGGACAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAGTC TTGA 

This corresponds to the amino acid sequence <SEQ ID 21 8; ORF 050-1 .ng>: 

g050-l.pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EKMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTI PTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVED*PDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVNMLD KGEELPVDFT NRLI YYVGPV DPVGDEVVGP AGPTTATRMD 

401 KFTRQMLKQT GLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAVYEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 

g050-l/p!4407 

sp| P14 4 07 | FUMB_ECOLI FUMARATE HYDRATASE CLASS I, ANAEROBIC (FUMARASE) 
>gi I 280063 I pir | | B44511 fumarate hydratase (EC 4.2.1.2) fumB, iron-dependent - Escherichia coli 
>gi I 146048 (M27058) anaerobic class I fumarase (EC 4.2.1.2) [Escherichia coli] Length = 548 

Score = 172 bits (432), Expect - 4e-42 

Identities = 138/488 (28%), Positives = 216/488 (43%), Gaps = 22/488 (4%) 

QSICDAFQFISYYHPKDYI DALYKAWQKEENPAAKDAMTQILVNSRMCAENNRPICQDTG 70 
Q+ DA + H K L+ E + K Q L NS + A+ P CQDTG 
QAFHDASFMLRPAHQKQVAAILHDPEASEND KYVALQFLRNSEIAAKGVLPTCQDTG 109 

I ATVFLKVGMDVQWDADMS VEKMVNEGVRRAYTWEGNTLRAS VLADPAGK RQNTKDNTPA 130 

A + KG V W E+ +++GV YEN + A K NT N PA 

TAI I VGKKGQRV-WTGGGD-EETLSKGVYNTYI -EDNLRYSQNAALDMYKEVNTGTNLPA 166 

VIHMS IVPGGKVEVTCAAKGGGSENKSKL AMLNPSDNIVDWVLKTI PTMGAGWCP 185 

I + V G + + C AKGGGS NK+ L A+L P + +++++ + T+G CP 



EL + + L EL E+ 



Query : 


11 


Sbjct : 


53 


Query : 


71 


Sbjct : 


110 


Query : 


131 


Sbjct : 


167 


Query : 


186 


Sbjct : 


226 


Query : 


246 


Sbjct: 


285 


uery : 


302 ] 


Sbjct: 


344 


Query: 


359 


Sbjct: 


404 


Query: 


419 


Sbjct : 


464 


Query: 


478 


Sbjct: 


524 



D++++ P H AS P+ M +C+A R+++ ++ + 



RVD VDKLTKE EVAS WKTGDVLLLNGKI LTGRDAAHKRLVNM 358 

+VD+++ KE +++ + L L G 1+ GRD AH +L + 



+D G+ELP + IYY TTA RMD + + G + M+ K 



+A + YL ++GG AA L ++IK + +A+PELGMEA+++ EV+D P 



+ VD KG 
TILVDDKG 531 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 219>: 

m050-l . seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AG CT AC TAT C ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCTGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCGGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG AACGTCCAAT 

251 GGGATGCGGA CATGAGCGTG GAAGAGATGG TTAACGAAGG CGTACGCCGC 
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301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCC ATGCTCAATC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGT ATCGGCATCG GCGGCACGCC CGAAAAAGCC 

601 GTGCTGATGG CAAAAGAGTC CCTGATGAGC CACATCGACA TTCAAGAATT 

651 GCAGGAAAAG GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TGGGCATCGG CGCACAAGGC 

751 TTGGGCGGAC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTATCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCAAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCG GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGTGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGATAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 220; ORF 050- 1> 

m050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM NVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLN KGEELPVDFT NRLI YYVGPV DPVGDEWGP AGPTTATRMD 

4 01 KFTRQMLEQT DLLGMIGKSE RGVATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 

m050-l/g050-l 98.2% identity in 507 aa overlap 

10 20 30 40 50 60 

m050-l.pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
I I t I I I I I I I I I I I I I I I I I I I I I I I I I M 1 I I i I | | | | | | | | i | | | t I I I I I I I I I M i 

g050-l MTVIKQEDFI QSICDAFQFI SYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m050-l . pep NNRPICQDTG I AT VFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
I I I I M I M I I I I I I I I I I I : I I I I II I I I I : I I I I I I M I I I I I I ! I I I I I I I I I I I I I 
g050-l NNRPICQDTGIATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 5 0 - 1 . pep RQNTKDNTPAVIHMSI VPGGKVEVTCAAKGGGSENKSKLAMLNPSDNI VDWVLKTI PTMG 
M I I I I I I I I I I I I ! I II I I I I I I I I I I I I I I I i I I I I I I I I I I | I | | | | | | | | | | | | | I 
g050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m050-l . pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
I M I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I | I I I | | | | | | | | | | | | I I I I I I I I I I I I 
g050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m050-l .pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
N ! I I I I I I I I I I I I 1 f I I I I I I I I | | I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I 
g050-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 

250 260 270 280 290 300 



WO 99/57280 



PCT/US99/09346 



256 



310 320 330 340 350 360 

m050-l .pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 
I 1 i t ) ) I I 1 II t 1 1 I I I M 1 1 1 I U I I 1 I I i II I I I I I ! I I I I I I I ) I 1 t : I i : 
g050-l PRVEDXPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVNMLD 

310 320 330 340 350 360 

370 380 390 400 410 420 

m050-l . pep KGEELPVDFTNRLIYYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
I I I ! I I f I I I I I I I I I I I (I I I I I I I I I I I I I f I I I I I I I I I I I I I I : I I I I I I I I I I I 
g050-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLKQTGLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

m050-l.pep RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA 
I I : I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I !: I I I I i I I I I I i I I 
g050-l RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
m050-l .pep VDSKGES IHATAPRKWQAKIGI I PVESX 
I I I I I I I f I I t I 1 I I I I I I I I I I I I I i I 
g050-l VDSKGES IHATAPRKWQAKIGI I PVESX 

490 500 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 22 1>: 

a050-l . seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AACCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGCAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGATACCGGT ATCGCGACCG TGTTTTTGAA AGTCGGTATG GATGTGCAAT 

2 51 GGGATGCAGA CATGAGCGTC GAAGAGATGG TTAACGAAGG CGTGCGCCGC 

301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTTC TCGCCGACCC 

351 CGCCGGCAAA CGCCAAAATA CCAAAGACAA CACGCCCGCC GTCATCCATA 

4 01 TGAGCATCGT GCCGGGCGAC AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

4 51 GGCGGTTCTG AAAACAAATC CAAACTCGCC ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATTCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGTACGCC CGAAAAAGCC 

601 GTGTTGATGG CGAAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TAGGCATCGG CGCGCAAGGC 

7 51 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCGAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGACGAAAT CGTCGGCCCA GCAGGTCCGA CCACCGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

1251 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

14 01 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGACAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC CCCAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCAAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 222; ORF 050-l.a>: 

a050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGD KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD V DKLTKE E V A SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLD KGEELPVDFT NRLI YYVGPV DPVGDEIVGP AGPTTATRMD 

4 01 KFTRQMLEQT DLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPPQWQAKI 

501 GIIPVKS* 
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a050-l/m050-l 98.4% identity in 507 aa overlap 

10 20 30 40 50 60 

a050-l ,pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
t I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I II ! I I I I I I I I I ! t I I II I I I I I I I I I 
m050-l MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

aO 50- 1 . pep NNRPICQDTGI ATVFLKVGMDVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
I I I I I I I I I I I I I I I ) I I I I : I t I I II I i I I 1 I I I I I I 1 I I I 1 1 I t I I 1 1 ) I I I ) I I I II 
m050-l NNRPICQDTGI ATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

a 05 0-1 .pep RQNTKDNTPAVIHMSIVPGDKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I I I I I I I I I I I I I I I I I M I I I I I I I i I I I J I I I I I I I I I I I I I I I I I I I I I II I i i I I 
m050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

a050-l .pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
m050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

aO 5 0-1. pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMI PNCAATRHVEFELDGSGPVELTP 
I I I I I I I I 1 I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I i I I I I I I I I I I I I I I I I 
m050-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPI AMI PNCAATRHVEFELDGSGPVELTP 

250 260 270 280 290 300 

310 320 330 340 350 360 

a050-l.pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLD 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I : 
m050-l PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 

310 320 330 340 350 360 

370 380 390 400 410 420 

aOSO-l.pep KGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
I I I I I I I I I I I I I I M I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
m050-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

a050-l . pep RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA 
I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I E I I I I I I i I I I I I I I I 
mO50-l RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
a050-l .pep VDSKGESIHATAPPQWQAKIGIIPVKSX 
I I I I I I I I I I I I I : I I I I I I I I I I : I I 
mO 5 0 - 1 VDSKGES IHATAPRKWQAKI G 1 1 PVESX 

490 500 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 223>: 

g052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

3 01 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

3 51 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 224; ORF 052.ng>: 

g052.pep 
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1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 225>: 

m052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAAC AG C AGTTTGGTGT TGG CTTTGAC 

2 01 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 
251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 
301 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

3 51 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 226; ORF 052>: 

m052 .pep 

1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 227>: 

a052 . seq 

1 ATGGCTTTGG TCGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGAGAGCCG ACAGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCTCCC 

151 AAGGGATTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCGTTTA TATCAGTCGG CGACACGTGA CTCACTTCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAATCAC ATGGTCGCCC GCCTGCAAAA AGGTGAAAAA 

351 CGCCGCCTGA 



This corresponds to the amino acid sequence <SEQ ID 228; ORF 052.a>: 

a052 .pep 

1 MALVAEETEI SAPCFKG*EP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DT* LTSMPNLVTM LLIKPTWPN 

101 RLRLEITWSP ACKKVKNAA* 



m052/a052 95.8% identity over a 1 19 aa overlap 

10 20 30 40 50 60 

m052 .pep MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 
I I I I II I I II I I I I I i I I I II I I I I 1 I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I 
a052 MALVAEETEISAPCFKGXEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m052 . pep S LVLALT AAFH S FI SVGDTRLT PMPN LVTMLLI KPT VVPNRLRLETTW S P ACRKVKN AAX 

I I II I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I 111111:1111111 
a052 SLVLALTAAFHSFI SVG DTXLTSMPN LVTMLLI KPT VVPNRLRLE I TVtfSPACKKVKNAAX 

70 80 90 100 110 120 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 052 shows 95.8% identity over a 1 19 aa overlap with a predicted ORF (ORF 052.ng) 
from N. gonorrhoeae: 

m052/g052 

10 20 30 40 50 60 

m052 . pep MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKASKSATSP KGLDGVSKNS 

I II i IMIMM M III I M I II II I II (Mil I I! Mil II Ml II III lil Ml II I I 
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g052 MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m052 . pep S LVLALTAAFHS FI SVGDTWLTSMPNLATMLLI KPTWPNRLRLE I TWS PACKKVKNAAX 

1 1 1 1 M I M 1 1 1 M i I M I II lllhlMIMIMIIMMII lllllhllIMM 

gO 5 2 SLVLALTAAFHS F I S VGDTRLTPMPNLVTMLLI KPTWPNRLRLETTWSPACRKVKNAAX 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 229>: 

g073 . seq 

1 ATGTGTATGC CATACGCAAT AAGGGTTTCA GACGGCATCT GCCGCATTTT 

51 TCCGCCGATG CCGTCTGAAA CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AAC CGACCGA CAGGCGCACC 

151 AGTCCGGGGC GGatacCGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT TGGTGATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAAAAGAGT TCCACGACTT TCCACGCGGC TGCTTGGTCG 

301 GCGACTTCAA AACCGATGAC GATGCCGCCG CCGTTTTGCT GTTTG CGGAT 

351 AAGCTCCGCC TGCGGATGGT CGGGCAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 230; ORF 073.ng>: 

g073 .pep 

1 MCMPYAIRVS DGICRIFPPM PSETRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 SPGRIPAASF SSGCILPCW VHGLVMV ERT SPRLAVREKS STTFHAAAWS. 

101 ATSKPMTMPP PFCCLRISSA CGWSGNPV* 

The following partial DNA sequence was identified inW meningitidis <SEQ ID 23 1>: 

m073 . seq 

1 ATGTGTATGC CATATAAGAT AAGGGTTTCA GACGGCATCT GCTGTCCAAT 

51 GCCGTCTGAA ACACGCAATC AGCGTGCGAG TGCCTGTTTC AAATCGTCAA 

101 TCAAATCGCC AACATATTCC AAACCGACCG ACAGGCGCAC CAATC CGGGG 

151 CGGATGTTGG CGG CGAGTTT TTCTTCGGGC TGCATCCTGC CGTGCGTGGT 

201 TGTCCACGGG TGGGTAATGG TCGAG CGCAC GTCACCGAGG TTGGCGGTGC 

251 GGGAAAAGAG TTCCACGCCG TCCACAACTT TCCACGCCGC TTCTTGATCG 

301 GCAACTTCAA AGCCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

351 AAGCGCCGCC TGAGGATGGT CGGACAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 232; ORF 073>: 

m073 .pep 

1 MCMPYKIRVS DGICCPMPSE TRNQRASACF KSSIKSPTYS KPTDRRTOPG 

51 RMLAASF SSG CILPCVWHG WVMV ERTSPR LAVREKSSTP STTFHAASXS 

101 ATSKPMTMPP PFCCLRISAA XGWSDNPV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 233>: 

a073 . seq 

1 ACGTGTATGT CATATAAGAT AAGGGTTTCA GACGGCATTT GCGGTGTTTT 

bl TCCGCCGATG CCGTCTGAA. CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AATCCGGGGC GGATGTTGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT GGGTAATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAGAAAAGT TCGACGCCGT CCACGACTTT CCACGCGGCT 

301 GCTTGGTCGG CGACTTCAAA GCCGATGACG ATGCCGCCGC CGTTTTGCTG 

351 TTTGCGGATA AGCTCCGCCT GAGGATGGTC GGGTAATCCG GTGTAA 

This corresponds to the amino acid sequence <SEQ ID 234; ORF 073. a>: 

a073.pep 

1 TCMSYKIRVS DGICGVFPPM PSEXRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 NPGRMLAASF SSGCILPCW VHGWVMV ERT SPRLAVREKS STPSTTFHAA 

101 AWSATSKPMT MPPPFCCLRI SSA*GWSGNP V* 



m073/a073 92.3% identity over a 130 aa overlap 
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10 20 30 40 50 

m073 - pep MCMPYKIRVSDGICC PMPSETRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

M M I I I I I I M I I I I I : I I I I I I I I I I | | | | | | | | | M I I I M I I I I I I I I I I 

a07 3 TCMSYKIRVSDGICGVFPPMPSEXRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

10 20 30 40 50 60 

60 70 80 90 100 110 

m073 .pep SSGCILPCVVVHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 
I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I : M II II I I I M I I I I I II 
a07 3 SSGCILPCVVVHGWVMVERTSPRLAVREKSSTPSTTFHAAAWSATSKPMTMPPPFCCLRI 

70 80 90 100 110 120 



120 129 
mO 7 3 . pep S AAXGWS DN PVX 

I : II I I I (III 
a 0 7 3 S SAXGWSGN PVX 

130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 073 shows 87.0% identity over a 131 aa overlap with a predicted ORF (ORF 073 .ng) 
from N. gonorrhoeae: 

m073/g073 



10 20 30 40 50 

mO 7 3 . pep MCMPYKI RVSDG ICC PMPS ETRNQRAS ACFKS S I KS PTYS KPTDRRTNPGRMLAAS F 

INN M M M 1 1 1 1 M I M 1 1 I 1 1 I M 1 1 M 1 1 1 M 1 1 1 1 1 M h M h MM 

9073 MCMPYAI RVSDG I CRI FP PMPS ETRNQRAS ACFKS S I KS PTYS KPTDRRTS PGR I PAAS F 

10 20 30 40 50 60 



60 70 80 90 100 110 

m073 . pep SSGCILPCVVVHGWVIWERTSPRliAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 

MMIMIMMI MMMMMMIMMI MUM MMMMMMMMM 

g073 SSGCILPCVWHGLVMVERTSPRLAVREKSST TFHAAAWSATSKPMTMPPPFCCLRI 

70 80 90 100 110 



120 129 
mO 7 3 . pep S AAXGWSDNPVX 

hi Ml MM 

g 0 7 3 SSACGWSGNPVX 
120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 235>: 





g075 .seq 














1 


ATGCCGCCTT 


ACTTCATCAC 


CCTCTTAACG 


ATGGAAAATA 


CAAAAAGCGC 




51 


GGCGAAAACG 


CCCACTACAA 


TCCAACCGGC 


TTCCATACCG 


TCCGCTTTTG 




101 


CGGCTTCCAA 


AGCGTTTTTT 


GCCGTTTCGG 


GCAACGCTGC 


GTTTGCCTGT 




151 


GCCGCCAAAG 


CCAGCGGGGC 


GGCTGTTACA 


ACAGCCAGTT 


TTGCGCCGTA 




201 


TTTACGGCAG 


GTGTTAATAA 


ATTTCATGAT 


ATTTTCCTTT 


ACGAAATTTT 




251 


TAAAAAAATG 


TGTTTGCGGG 


CTTTGTGAAG 


GTTTTAGAGA 


CCGCCTGCCG 




301 


GGCCTCTTAA 


ACTTAATCTT 


CTTTTTCGTA 


GAATCCGAAA 


ATTACAAATT 




351 


CCCCGCCTAT 


CTCTTCCAAT 


GCCGAGCTAA 


AAGCGTCTTC 


ATAG CTGTC A 




401 


TATTTACCGG 


CTGA 









This corresponds to the amino acid sequence <SEQ ID 236; ORF 075.ng>: 

g075 .pep 

1 MPPYFITLLT MENTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNAAFAC 

51 AAKASGAAVT TASFAPYLRQ VLINFMIFSF TKFLKKCVCG LCEGFRDRLP 

101 GLLNLIFFFV ESENYKFPAY LFQCRAKSVF IAVIFTG* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 237>: 



m075. seq 












1 


ATGCCGTCTT 


ACTTCATCAC 


TCTCTTAACG 


ATGGAAAATA 


CAAAAAGCGC 


51 


GGCGAAAATG 


CCCACTACAA 


TCCAACCGGC 


TTCCATACCG 


TCCGCTTTTG 


101 


CGGCTTCCAA 


AGCGTTTTTT 


GCCGTATCGG 


GCAACGTTGC 


ATTTGCATGT 


151 


GCGGCCAAAG 


CCAGGGGAGC 


AGCTGTTACA 


ACAGCCAGTT 


TTGCGCCGTA 


201 


TTTACGGCAG 


GTGTTAATAA 


ATTTCATGAT 


ATTTTCCTTC 


AAAAAGTGTT 


251 


TGGCGGTAAT 


GGATGGAGCG 


TTTTTCAGAC 


GACCGCCGAA 


CATCCGAAAA 


301 


TCAGTCTTTC 


AAAAAT CCG A 


ATACGACAAA 


TTCGTATTGG 


TTGCCGATTT 


351 


CTTCCAAACC 


TGCGTTAATC 


GCTTCTTCGA 


AGTCG TAGAA 


ATAATCGGCA 


401 


TTGGTGATTA 


A 









This corresponds to the amino acid sequence <SEQ ID 238; ORF 075>: 

xn075 .pep 

1 MPSYFITLLT MENTKSAAKM PTTIQPASIP SAFAASKAFF AVS GNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVMDGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 075 shows 65.7% identity over a 137 aa overlap with a predicted ORF (ORF 075.ng) 
from N. gonorrhoeae: 

m075/g075 

10 20 30 40 50 60 

MPS YFI TLLTMENTKS AAKMPTT I QPAS I PS AFAASKAFFAVSGNVAFACAAKARGAAVT 

II IMIMMMMIMI IMMMIIIIIIIMIIIMIIIhlllMIM Mill 

MPPYFITLLTMENTKSAAKTPTTIQPASIPSAFAASKAFFAVSGNAAFACAAKASGAAVT 
10 20 30 40 50 60 

70 80 90 100 110 

TASFAPYLRQVLINFMIFSF KKCIiAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVAD 

IIIMIIIMI1MIIIIII IN- ::||!::::| | = = : : | 

TASFAPYLRQVLINFMIFSFTKFLKKCVCGLCEGFRDRLPGLLNLIFFFVESENYKFPAY 
70 80 90 100 110 120 

120 130 
FFQTCVNRFFEWEI IGIGDX 
:|| :: | :| : | 
LFQCRAKSVFI AVI FTGX 
130 



m075 -pep 
g075 

m075 .pep 
g075 

ra075 .pep 
g075 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23 9>: 

a075 . seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAAGA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCTGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AG T CG T AG AA ATAATCGGCA 

401 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 240; ORF 075. a>: 

a075 .pep 

1 MPSYFITLLT MEKTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVM DGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 
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m075/a075 98.5% identity over a 136 aa overlap 

10 20 30 40 50 60 

m075 . pep MPSYFITLLTMENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
I 11 I It t I t j M : M I I I I I I I I I I I I I 1 I I I I I I I I I I I II I! 1 I I I I I I I I I I I M I 
a075 MPSYFITLLTMEKTKSAAKTPTTIQPAS I PSAFAASKAFFAVSGNVAFACAAKARGAAVT 

10 20 30 40 50 60 

70 80 90 100 110 120 

m075.pep TASFAPYLRQVLINFMIFSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 
I I I I I I I I I I I I I I I I M I I M I M I I I I I I I I I I I II I I I I I I I I M I I I I II II I I II 
a075 TASFAPYLRQVLINFMIFSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 

70 80 90 100 110 120 

130 

n\07 5 . pep CVNRFFEWEI IGIGDX 

M I I I I I I I II I I M I I 
a075 CVNRFFEWEI IGIGDX 

130 

The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 24 1>: 

g080 . seq 

1 ATGT GGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA C C TGGTTT AT 

151 TCCGATAAGA AGGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCG CAC A GGAAGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA TACGGTTGAG 

3 01 GTCGTCCTGA CCGAGCGCAA GCCGGTTGCA CGTTGGGGCG ACCATGCCTT 

3 51 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 
451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 
501 GATGACCTAT ACGGCACGTT CGGCGTGGAA TGT CGTTTTG GACAACGGCA 
551 TCACCGTCAG GCTCGGifCGG GAAAAcg a GA TGAAACGCCT CCgGCTTTTT 
601 ACcgAAGCGT GGCAGCATCT gttgcGTAAG AATAAAAATC GGTTATCCTA 
651 TGTGGATATG Aggtataagg acggatttTC agtcccccat gctCCCGACG 
701 GTTTACCCGA AAAAGAATcc g AAGAATa 1 1 gggaacaggt ttgggacata 
751 ttacggcctg gcgtcggaaa cggttcgacg caaatttcaa tcagttatAA 
801 GGGCAGacga acaatggaac AGcagtaa 

This corresponds to the amino acid sequence <SEQ ID 242; ORF 080.ng>: 

g0 80 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVS LKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWNWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLS YVDM RYKDGFSVPH APDGLPEKES EEYWEQVWDI 

251 LRPGVGNGST QISISYKGRR TMEQQ* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 243>: 

m080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGACATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

2 01 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

401 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 

551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 
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601 ACCGAAGCGT GGCAGCATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 
651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTTCCGACG 
701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 2441 ; ORF 080>; 

m080 . pep 

1 MWDNAEAMER LTRWLLVMMA MLLAASGLW FYNSNHLPVK QVSLKGNLVY 

51 SDKKTLGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARS AW I WL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY ASDGLPEKES EE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 080 shows 97.9% identity over a 242 aa overlap with a predicted ORF (ORF 080.ng) 
from N. gonorrhoeae: 

m080/g080 

10 20 30 40 50 60 

m0 8 0 . pep ^DNAEAMERLTRWLLVMMAM^ 

I IMIMI Mill MMIIMM IM MM Ml Mi M MMM II I I I IMM iM II M 

080 MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYT^SNHLPVKQVSLKGNLVYSDICKALGSLA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m0 8 0 . pep KEYIHGNILRTDINGAQEAYRRYPWI ASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 

MM ll II MM M MM IMIMI M I I I IMM I I M I I M M M I M I MM II I I M 

080 KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 8 0 . pep EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLiAKQGLGIKEMTYTARSAWI WL 

Ml Ml Ml I Ml I II I Ml M M M I M M II! I M MM M M M M MM M M III 

080 EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGI KEMTYTARSAWNWL 

130 140 150 160 170 180 

190 200 210 220 230 240 

mO 8 0 . pep DNG I TVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRY ASDGLPEKES 

Ml II I II M M I I! M MM I M I I I Ml MM I M II M I M I I I : | MMIIII 

080 DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVPHAPDGLPEKES 

190 200 210 220 230 240 



m0 80.pep EEX 
I I 

080 EEYWEQVWDI LRPGVGNGSTQI S I SYKGRRTMEQQX 

250 260 270 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 245>: 

a080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTAGTTTAT 

151 TCCGATAAGA AAGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGTTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 
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551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 

601 ACCGAAGCGT GGCAACATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 

651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTCCCGACG 

701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 246; ORF 080.a>: 

a080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 VVLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIVVL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY APDGLPEKES EE* 

m08o/a080 99.2% identity over a 242 aa overlap 

10 20 30 40 50 60 

m080 . pep MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKTLGSLA 
I I M I M I M I I I I I I I 1 I I I M I I ! I ! | | | | | M M I M I I I I I I I I I I I I I I : I I j | | 
a080 MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKALGSLA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m080.pep KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEVVLTERKPVARWGDHALVDG 
I I i I M II II II I I! M I I I I I I I I I I f I I I I | | | M | | | | | | | | M | ( M | | | | | M I I 
a080 KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 

70 80 90 100 110 .120 

130 140 150 160 170 180 

mO 8 0 . pep EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWI VVL 

M M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I | | M I I 
a080 EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIVVL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m080 . pep DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYASDGLPEKES 
M I I II II I I I I I I I I I I I M I M I I I I M I I I II I II I I I I I I I II I I I I MINIM 
a080 DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYAPDGLPEKES 

190 200 210 220 230 240 



m080.pep EEX 
I I I 

a080 EEX 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 247>: 

CAAGCCCTCA AG CTTCCGAT 
CGTAACCGAC AGCCGCGATA 
CGGGCGGGCG GTTTGACGCG 
GGCGCGGCGG CGGTTGTGGT 
CGCGTTGAAA GTCGATGACA 
CGTGGCGCGA TAATGTGAAC 
GGCAAGACGA CGGTGAAGGA 
CGGCGATGAT GCCGTTTCGG 
gaTTGCCGCT GACTTTATTG 
ATTGAAATGG GCATGAACCA 
CGCCAAACCC GATGCCGCTT 
GATGCGGTTt cgacggagtg 



g081 . seq 








1 


ATGAAACCAC 


TGG AC CTAAA 


TTTCATCTGC 


51 


GCCGTCTGAA 


AACAAACCCG 


TGTCGCGCAT 


101 


TTCGGGAAGG 


CGATGTGTTT 


TTCGCATTGG 


151 


CATGATTTTG 


TTGGAGGCGT 


ATTGTCTGCG 


201 


TTCGCGCGAA 


GATTGCGCGG 


CTTTGGGCGG 


251 


CGCTTGCCGC 


GTTGCAAACG 


TTGGCGAAGG 


301 


CCGTTTGTGT 


TCGGCATTAC 


CGGTTCGGGC 


351 


GATGCTGGCT 


GCGGTATTGC 


GCCGCCGTTT 


401 


CGACGGCAGG 


CAACTTCAAC 


AACCACAt eg 


451 


AAATtaaAcg 


aAAAACACCG 


CTATGCCGTG 


501 


TTTTGGcgaa 


ctggcggtTt 


taacgcaaaT 


551 


TGGtcaACAA 


CGCCCTGCGC 


GCCCATGTCG 


601 


GGCGATATTG 


CCAAAGcgaa 


a a g c GAGATT 


651 


CGGCATGGCA 


CTGATTCCTC 


AAGAAGATGC 


701 


CGGCAACGTT 


TAATTTGAAT 


ACGTGCACTT 


751 


GTCCGCGCGG 


AAAATATCGT 


GCTGAAACCT 
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801 GGTGTGCGGC GACGAGCGCA CTGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

8 51 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCCGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG CAAGGCTTCA GCAACATCAA 

951 AGGCCGTCTG AACGTCAAAG CCGGCATCAA GGGCGCAACC CTGATTGACG 

10 01 AT ACT TAT AA TGCGAATCCC GACAGTATGA AAGCCGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG CATTTTCGTG ATGGG CGAT A TGGGCGAACT 

1101 GGGCGAGGAc gaAGCCGCCG CCATGCACGC CGAagtcgGC GCGTACGCCC 

1151 GCGACCAAGG CAT CGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

1201 GCGGcggaAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

12 51 GTTGATTCAA GTGTTGAGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

13 01 TGAAAGGTTC GCGCTTTATG CAGAtggAAG AAGTGGTCGA GG C ATTGGAG 
13 51 GATAAGTga 

This corresponds to the amino acid sequence <SEQ ID 248; ORF 081.ng>: 

g081.pep 

1 MKPLDLNFIC QALKLPMPSE NKPVSRIVTD SRDIREGDVF FALAGGRFDA 

51 HDFVGGVLSA GAAAVVV S RE DCAALGGALK VDDTLAALQT LAKAWRDNVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVSATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNALR AHVGCGFDGV 

201 GDIAKAKSEI YAGLCSDGMA LIPQEDANMA VFKTATFNLN TCTFGVDSGD 

251 VRAENIVLKP LSCEFDLVCG DERTAWLPV PGRHNVHNAA AAAALALAAG 

3 01 LSLNDVAEGL QGFSNIKGRL NVKAGI KGAT L I DDTYNANP DSMKAAVDVL 
351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG A Y ARDQG I E A AYFVGDNSVE 

4 01 AAEKFGADGL WFAAKDPLIQ VLSHDLPERA TVLVKGSRFM QMEEWEALE 
451 DK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 249>: 

m0 81 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGAGCG GTTTGACGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCT GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGTGCTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTGCCGC ATTGCAAACG CTGGCAAAGG CGTGGCGTGA AAATGT GAAT 

3 01 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

3 51 AATGCTGGCT GCGGTATTGC GCCgCCGTTT CGGCGATGAT GCCGTGTTGG 

4 01 CGACGGCAGG CAACTTCAAC AAC CAT ATCG GATTGCCGCT GACTTTGTTG 
451 AAGTTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 
501 TTTCGGCGAA CTGGCGGTTT TAACGCAmAT CGCCAAACCA AATGCCGCAT 
551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 
601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TACCAAGGTT TATGTTCAGA 
651 CGGCATTGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 
701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 
751 GTTCACGCGG AAAATATTGT GCTGAAACCG TTGTCGTGCG AATT TGATTT 
801 GGTGTGCGGC GATGAGCGCG CCGCCGTGGT GCTGCCTGTT CCCGGCCGCC 
851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCGGGT 
901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 
951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGAT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGG CGAT A TGGGCGAACT 

1101 GGGCGAACTG GGCGAGGACG AAGCCGCCGC TATGCACGCC GAAGT CGGCG 

1151 CGTATGCCCG CGACCAAGGC AT CGAAG CGG CTTATTTTGT CGGCGACAAC 

1201 AGCGTCGAAG CGGCGGAAAA ATTTGGCGCG GACGGTTTGT GGTTCGCCGC 

1251 CAAAGACCCG TTGATTCAAG TGTTGCGCCA CGATTTGCCC GAACGCGC C A 

13 01 CCGTGTTGGT GAAAGGTTCG CGCTTTATGC AGATGGAAGA AGTGGTCGAG 

13 51 GCATTGGAGG ATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 250; ORF 08 1>: 

m081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGERFDA 

51 H DFVEDVLAA GAAAVWS RE DCAAMDGALK VDDTLAALQT LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTXIAKP NAALVNNAMR AHVGCGFDGV 
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201 GDIAKAKSEI YQGLCSDGIA LIPQEDANMA VFKTATLNLN TRTFGIDSGD 

2 51 VHAENIVLKP LiSCE FDLVCG DERAAWLPV PGRHNVHNAA AAAALALAAG 

3 01 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAI DVL 
3 51 ARMPAPRIFV MGDMGELGEL GEDEAAAMHA EVGAYARDQG IEAAYFVGDN 
401 SVEAAEKFGA DGLWFAAKDP LIQVLRHDLP ERATVLVKGS RFMQMEEWE 
451 ALEDK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 081 shows 94.1% identity over a 455 aa overlap with a predicted ORF (ORF 081.ng) 
from K gonorrhoeae: 

m081/g081 



10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKIiPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 

I i 1 1 M 1 1 I M I IMI I MhMIMI I 1 1 Ml II MINIMI M M I II I MM 

g081 MKP LDLN F I CQ ALKL PM PS ENKP VS R I VTD S RD IRE GD V F FALAGGR FDAHDFVGGVLS A 

10 20 30 40 50 60 



70 80 90 100 110 120 

mOBl . pep GAAAVVVSREDCAAMDGALKVDDTLJUUjQTLAKAW 

MIMMMMMh I M M M I M II M I M M I M II M M II M M 1 1 M M M M 

gO 8 1 GAAAVVVSREDCAALGGALKVDDTLAAIjQTIJVKAWRDNTO 

70 80 90 100 110 120 



130 140 150 160 170 180 

m081 . pep AVLRRRFGDDAVIiATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 

MMMIMIM I I ! 1 1 1 1 1 1 1 1 1 1 1 M 1 I E 1 1 1 1 1 1 1 1 M i M 1 1 M I M 1 1 1 MM 

g081 AVLRRRFGDDAVSATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTQIAKP 

130 140 150 160 170 180 



190 200 210 220 230 240 

m081 . pep NAALVNNAMRAHVGCGFDGVGDIAKAKSEI YQGLCSDGIALIPQEDANMAVFKTATLNIiN 

MIIMMMMIIIIMMIIIIIIIMM M 1 1 1 I M 1 1 II II 1 1 1 1 1 1 II 1 1 M II 

g 0 8 1 DAALVNNALRAHVGCGFDGVGD I AKAKS E I YAGLCSDGMALI PQEDANMAVFKTATFNLN 

190 200 210 220 230 240 



250 260 270 280 290 300 

mO 8 1 . pep TRTFGIDSGDVHAENIVLKPLSCEFDIjVCGDERAAVVLPVPGRHNVHNAAAAAALALAAG 

I II ! M M I i M M II M II II 1 1 1 1 II M M M 1 1 I M I 1 1 1 II M M II 1 1 1 II I M 

g081 TCTFGVDSGDVRAENIVLKPLSCEFDLVCGDERTAWLPVPGRHNVHNAAAAAALALAAG 

250 260 270 280 290 300 



310 320 330 340 350 360 

m081 . pep LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVIiARMPAPRIFV 

III I II Ml h I II II I Mi M hi II MMIMMIMI II 1 1 1 h 1 1 1 1 1 1 1 M I M I 

gO 8 1 LSLNDVAEGLQGFSNIKGRIjNVKAGIKGATLiIDDTYNANPDSMKAAVDVIiARMPAPRI fv 

310 320 330 340 350 360 



370 380 390 400 410 420 

m081 . pep mgdmgelgelgedeaaamhaevgayardqgieaayfvgdnsveaaekfgadglwfaakdp 
MIMMM II II M 1 1 1 II M H I M I M I M 1 1 1 1 M I M II I II M I M 1 1 M I 

g 0 8 1 MGDMGELGE DE AAAMHAE VGAYARDQG I E AA YFVGDNS VE AAE KFGADGLWFAAKDP 

370 380 390 400 410 



430 440 450 

mO 8 1 . pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

Mill IN M MM M Ml M M I M I M IMMI 

g081 LIQVLSHDLPERATVLVKGSRFMQMEEWEALEDKX 
420 430 440 450 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 251 >: 

a081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGATGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCG GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGTTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

2 51 CGCTTACCGC GTTGCAAATG TTGGCGAAGG CGTGGCGCGA GAATGTGAAC ' 

301 CCGTTTGTGT TCGGTATTAC CGGCTCGGGC GGCAAGACGA CGGTGAAGGA 

351 AATGTTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATAAT GCCGTTTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACCACATCG GATTGCCGTT GACTTTGTTG 

4 51 AAATTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GTATGAACCA 

501 TTTTGGCGAA CTGGCGGTTT TGACACAAAT CGCCAAACCC GATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TATCAAGGCT TATGTTCAGA 

651 CGGCATGGCG CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 

7 51 GTCCACGCGG AAAATATCGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 

801 GGTGTGCGGC AACGAGTGCG CAGCCGTGGT TCTGCCCGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCCGCCGCCG CGCTGTCTTT GGCTGCAGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGTGAGGAC GAAGCCGCCG CCATGCACGC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG 

1201 GCGGCGGAAA AATTTGGCGC GGACGGTTTG 

1251 GTTGATTCAA GTGTTGCGCC ACGATTTGCC 

1301 TGAAAGGTTC 

1351 GATAAGTGA 



CGAAGTCGGC GCGTACGCCC 
TCGGCGACAA CAGCGTCGAA 
TGGTTCGCCG CCAAAGACCC 
CGAACGCGCC ACCGTGTTGG 
GCGCTTTATG CAGATGGAAG AAGTGGTCGA GGCATTGGAG 



This corresponds to the amino acid sequence <SEQ ID 252; ORF 081.a>: 

a081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGGRFDA 

51 HDFVEDVLAA GAAAVWSRE DCVAMDGALK VDDTLTALQM LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDN AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNAMR AHVGCGFDGV 

201 GDIAKAKSEI YQGLCSDGMA LIPQEDANMA VFKTATLNLN TRTFGIDSGD 

251 VHAENIVLKP LSCEFDLVCG NECAAWLPV PGRHNVHNAA AAAALSLAAG 

301 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARM P APR I FV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

4 01 AAEKFGADGL WFAAKDPLIQ VLRHDLPERA TVLVKGSRFM QMEEVVEALE 

451 DK* 

m08i/a08i 96.7% identity over a 455 aa overlap 

10 20 30 40 50 60 

m081 .pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 
I I I 1 I I M I I I M I I I I I I I I I I I I I M I M | | I M M I I I t i II II I I I I I M I I I I I 
a081 MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGGRFDAHDFVEDVLAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m081 . pep GAAAVWSREDCAAMDGALKVDDTLAALQTLAKAWRENVNPFVFGITGSGGKTTVKEMLA 
I I I I I I I I I I I I : I I I I I I I I I I I I : I I I I I II II I I I I I I I I I I II I | | I M I I I I I I 
a081 GAAAVVVSREDCVAMDGALKVDDTLTALQMLAICAWRENVNPFVFGITGSGGKTTVKEMLA 

70 80 90 100 110 120 



130 140 150 160 170 180 

m081 . pep AVLRRRFGDDAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNKFGELAVLTXIAKP 
I I I I I I I I I : M I I II I I I I M | I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I 
a081 AVLRRRFGDNAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTQIAKP 
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130 140 150 160 170 180 

190 200 210 220 230 240 

m081 pep N AALVN N AMRAH VGCG FDG VGD I AKAKS E I YQG LC S DG I AL I PQE D ANMAV FKT AT LNLN 

: I I I I I I I i I ! I I I I I I I I I M II I I I I I I I I I I M I I : i I I I I I I ! I I I I I I I I M ! I I 
a 081 DAALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGMALIPQEDANMAVFKTATLNLN 

190 200 210 220 230 240 

250 260 270 280 290 300 

m081 pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAVVLPVPGRHNVHNAAAAAALALAAG 
I | M I I I ! I I M I I I I I I I I I I I I I I I I I I = I I I I I 1 M I hi I I I I I I I I I I I I : I I I I 
a 081 TRTFGIDSGDVHAENIVLKPLSCEFDLVCGNECAAVVLPVPGRHNVHNAAAAAALSLAAG 

250 260 270 280 290 300 

310 320 330 340 350 360 

m081 . pep LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 
I I I M I I I I I I M M I I I I M M I I I I I I I M M I I I M I I I I I I I : I I I M I I I I I I I I 
a 081 LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAVDVLARMPAPRIFV 

310 320 330 340 350 360 



370 380 390 400 410 420 

m08 1 pep MGDMGELGELGEDEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 
MINIMI I I I M I I I I I II I I I I M I I I I I I I I M I I I I I I! I II I I I I 1 I i I I I 

a 081 MGDMGELGE DE AAAMHAE VG A YARDQG IE AAY FVG DN S VE AAEKFGADGLW FAAKDP 

370 380 390 400 410 



430 440 450 

m08 1 . pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

I M M M II I I I I I M I I I M I I I I I I M I I I I I I I 
a08 1 LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

420 430 440 450 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 25 3>: 

g082 . seq 

1 aTGTGGTTGT TGAAGTTGCC TGCCGTCGCC GAAACGG CAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATCTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGGT TCACATTATC GCGCCACGCC 

151 TTCGCCAACG TTTGCAACGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 

2 01 GCCCAAAGCC GCGCAATCTT CG CG CGAAAC CACAACCGCC GCCGCGCCCG 

251 CAGACAATAC GCCTCCAACA AAAT CATGCG CGTCAAACCG CCCGCCCGCC 

301 AATGCGAAAA ACACATCGCC TTCCCGAATA TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGTTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

401 AGATGAAATT TAGGTCCAGT GGT TTCATAT TTGCTTTCGT TAATATTCGG 

451 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGCCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGACT GTCTGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCGGCAC ATCGGGGACA TTCCCCCAAA AAT CAT AG C C GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 254; ORF 082.ng>: 

g082 . pep 

1 MWLLKLPAVA ETASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTLSRHA 

51 FANVCNAASV SSTFNAPPKA AQSSRETTTA AAPADNTPPT KSCASNRPPA 

101 NAKNTSPSRI SRLSVTMRDT GLFSDGIGSL RAWQMKFRSS GFIFAFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

2 01 RRGWGLSVD KGKVIAFARH IGDIPPKIIA VIGQLVGFDT RPTAESA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 255>: 

m082 . seq 

1 ATGnnGTTGT TGAAGTTGCC TGCCGTCGCC AACACGGCAT CATCGCCGAA 

51 ACGGcGGCGC AATACCGCAG C C AG CAT TTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGAT TCACATTTTC ACGCCACGCC 

151 TTTGCCAGCG TTTGCAATGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 
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2 01 ATCCATTGCA GCACAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCAG 
251 C AG C C AAT AC GTCTTCAACA AAATCATGCG CGTCAAACCG CTCGCCCGCC 
301 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

3 51 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

4 01 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTACTTTCGT TAATATTCGG 
451 GCGGCGGACA CAT CGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 
501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGsATTT TTTCTGTACG 
551 TAT C ATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 
6 01 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 
651 CGCCCTGCAC ATCGGGAACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 
701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 256; ORF 082>: 

m0 82.pep 

1 MXLLKLPAVA NTASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTFSRHA 

51 FASVCNAASV SSTFNAPSIA AQSSRETTTA AAPAANTSST KSCASNRSPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSXF FLYVS FFRRI FSRFAFSRIP 

2 01 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 082 shows 92.7% identity over a 247 aa overlap with a predicted ORF (ORF 082 jig) 
from N. gonorrhoeae: 

m082/g082 

10 20 30 40 50 60 

MXLLKLPAVANTASSPKRRRNTAASISFTVVLPPEPVMPNTNGFTFSRHAFASVCNAASV 

I MM M MMMMMM MM MMMMMMM MMM! MMM MM MM I 

MWLLKL. PAVAETAS S P KRRRNTAAS I S FTWL P PE PVMPNTNGFTLS RHAFANV CNAAS V 
10 20 30 40 50 60 

70 80 90 100 110 120 

SSTFNAPS I AAQS SRETTTAAAP AANTSSTKS CASNRS PANAKNTS PARMSRLS VTMRDT 

111111} MIMMMMMM II IIMIIII II 1 1 II I I h h 1 1 I I I II I II 

SSTFNAPPKAAQSSRETTTAAAPADNTPPTKSCASNRPP AN AKNTSP SRI SRLSVTMRDT 
70 80 90 100 110 120 

130 140 150 160 170 180 

GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSXF 

MM M M M Ml M M M M M MM M MMM M MMM M M MMM M MM I 

GLFSDGIGSLRAWQMKFRSSGFIFAFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 
130 140 150 160 170 180 

190 200 210 220 230 240 

FLYVSFFRRI FSRFAFSRI P RRGWGQSVD KG KV I AFALHIGNI PPKI I AV I GQLVGFDT 

II I III I I III Mill I III II MM IIMM 1 1 Ml II Ml I II MM 111 I III 1 1 

FLYVSFFRRIFSRFAFSRIPRRGWGLSVDKGKVIAFARHIGDIPPKIIAVIGQLVGFDT 
190 200 210 220 230 240 



m0 82 .pep 
g082 

m082 -pep 
g082 

m082 .pep 
g082 

m082 -pep 
g082 



m082.pep RPTAESAX 

I M II 1 1 I 

g0 8 2 RPTAESAX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 257>: 

a082 . seq 

1 ATGTGGTTGT TGAAGTTGCC TGCCGTCGCC AAAACGGCAT TATCGCCGAA 
51 ACGGCGGCGC AATACCGCAG CCAACATTTC CTTCACCGTC GTCTTGCCGC 
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101 CCGAGCCGGT AATACCGAAC ACAAACGGGT TCACATTCTC GCGCCACGCC 

151 TTCGCCAACA TTTGCAACGC GGTAAGCGTG TCATCGACTT TCAACGCGCC 

201 ATCCATTGCA ACGCAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCCG 

251 CAGCCAATAC GTCTTCAACA AAATCATGCG CATCAAACCG CCCGCCCGCC 

301 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

401 AGATGAAATT TAGGTCCAGT GGTTTCATAT TTACTTTCGT TAATATTCGG 

4 51 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

501 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 

551 TATCATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 

601 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 

651 CGCCCTGCAC ATCGGGAACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 

701 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 



This corresponds to the amino acid sequence <SEQ ID 258; ORF 082. a>: 

a082 .pep 

1 MWLLKLPAVA KTALSPKRRR NTAANISFTV VLPPEPVIPN TNG FT FSRHA 

51 FANICNAVSV SSTFNAPSIA TQSSRETTTA AAPAANTSST KSCASNRPPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFT FVNIR 

151 AADTSVAADF FIACFAVV KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGVVGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

m082/a082 95.5% identity over a 247 aa overlap 



10 20 30 40 50 60 

m082 . pep MXLLKLPAVANTAS S PKRRRNTAAS IS FTWLPPE PVMPNTNG FTFSRHAFAS VCNAAS V 

I Milllll:ll I I I M I I I I I : I I 1 I M I I I I I I : I M I I I I I 1 I I M I :: I I I :! I 
a082 MWLLKLPAVAKTALSPKRRRNTAANISFTVVLPPE PVIPNTNGFTFSRHAFANICNAVSV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m082 . pep SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 
I f I t II I I I I : I I I 1 I I II I ! I I I I I I I M I I I I I I I I I I I I I I I 1 I I I I I M I I I I I I 
a082 SSTFNAPSIATQSSRETTTAAAPAANTSSTKSCASNRPPANAKNTSPARMSRLSVTMRDT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m082 . pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAVVKHRLFSHSHSXF 
I I I I I I I I I II M I II I I I M M M I I I I I I I I I I I I I I I II I II I I I I I M 1 I I I I I I 
a082 GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 

130 140 150 160 170 180 



190 200 210 220 230 24 0 

m082 . pep FLYVSFFRRIFSRFAFSRIPRRGVVGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 
M I I I I II M I II I I I I It I I I II I I I I I I I M I I I t I I I M I 11 I I I I t I I I I I I t M I 
a 0 8 2 FLYVSFFRRI FSRFAFSRI PRRGVVGQS VDKGKV I AFALHI GNI PPKI I AVI GQLVGFDT 

190 200 210 220 230 240 



mO 8 2 . pep RPTAESAX 
I I I I I I I I 



a082 RPTAESAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 259>: 

g084 . seq 

1 ATGAAa c a AT CCGcccgaat aAAAAATATG GATCAGACAT TAAAAAATAc 

51 attgggcatt tGCGCGcttt tagcctTTTG TTTTggcgcG gccaTCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGC 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GCTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCTT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GC AAT CCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

3 51 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 
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4 01 TTTGGAAATA TTGTGTATCT GTGGGGGTAT TTGCTGACGT AAAAAACTAT 

451 AAACGTCGCA GCAAAATATG G C TG AC CAT A TT AT TG AC T T TGATTTTGTC 

501 CTGCGCGGTG ATGGAGAAAA TCGccggcga taaAGATTGG CGAGaacctg 

551 atgccggcct gttgttgaat ATTTTcgacc tgtattaCga cttggctttc 

601 cgcgccggca cAATATGCCG CCAAGCGCGC CCAcattttg gaagCagcaa 

651 aaaaagcgtC AACATGGCAt atccgccaac ttgcgcccaa gTAtaa 

This corresponds to the amino acid sequence <SEQ ID 260; ORF 084.ng>: 

g084 . pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS VGVFADVKNY 

151 KRRS KIWLTI LLTLILSCAV ME KIAGDKDW REPDAGLLLN IFDLYYDLAF 

201 RAGT I CRQAR PHFGSSKKSV NMAYPPTCAQ V* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 261 >: 

m084 . seq 

1 ATGAAACAAT CCGCCcGAAT AAAa.ATATG AATCAGACAT TACTTTATAC 

51 ATTGGGCATT TGCGCGCTTT TAACCTTTnn nnnnnnnnnn nnnnnnnnnn 

101 nnnnnTATCA CCCnGAATAT GAATACGGCT ACCGTT AT T C TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

2 01 CGTTTCTTCA GTTGTT TT AC TGATTTACGT CGGCACAACC GCCCTATATT 
251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTT AT C AG AT AGTCGGTTCG 

3 01 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

3 51 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTT CAT TTT T GGCTTGACAG 

4 01 TTTGGAAATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 
4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 
501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATT TG CGAGAACCTG 
551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCT.TC 
601 CGCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 
6 51 AAAAAGCGTC AACATGGCAT ATC CGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ED 262; ORF 084>: 

m0 84 .pep 

1 MKQSARIKXM NQTLLYTLGI CALLiT FXXXX XXXXX YHPEY EYGYR YSAVG 

51 ALASWFLLL LA RGFP RVSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS GGVFADVKNY 

151 KRRS KIWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN I FDLYYDLAX 

201 RAGT I CRQAR PHFGSSKKSV NMAYPS CCAQ V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 084 shows 90.5% identity over a 231 aa overlap with a predicted ORF (ORF 084.ng) 
from A^. gonorrhoeae: 

m084/g084 

10 20 30 40 50 

m084 . pep MKQSARI KXMNQTLLYTLGI CALLTF YHPEYEYGYRYSAVGALASWFLLL 

MINIM hill IMIIIIhl II 1 ] 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 

g 084 MKQS AR I KNMDQTLKNTLG I CALLAFC FGAAI ASGYHLE YE YG YRYS AVGALAS WFLLL 

10 20 30 40 50 60 

60 70 80 90 100 110 

m084 . pep LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPS YQI VGS I LESNPAEAREFVGNLPGSL 

1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! i II I M I 1 1 i 1 1 1 1 1 M 1 1 1 1 

g084 LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGS I LESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 

120 130 140 150 160 170 

m084 . pep YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

II I II I II I I I II III MM ! t 1 1 I II I I 1 1 I 1 1 1 I I j M M I I II II M : I I I : 1 1 I 

gO 84 YFVQALFFIFGLTVWKYCVSVGVFADVKNYKRRSKIWLTILLTLILSCAVMEKIAGDKDW 

130 140 150 160 170 180 
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180 190 200 210 220 

m084 . pep RE PDAGLLLN I FDLYYDLAXRAGT I CRQARPHFGS SKKS VNMAYPSCCAQVX 

MM I I MM I M II ; M I II II I I I II I i I Ml I I I MM M i lllll 

g 0 8 4 RE PDAGLLLN I FDLYYDLAFRAGT I CRQARPHFGS SKKS VNMAYP PTCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 263>: 

a084 .seq 

1 ATGAAACAAT CCGCCCGAAT AAAAAATATG GAT C AG AC AT TAAAAAATAC 

51 ATTGGGCATT TGCGCGCTTT TAGCCTTTTG TTTTGGCGCG GCCATCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTT AT CAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

4 01 TTTGGAGATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCTTCC 

601 . GCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 264; ORF 084.a>: 

a084 .pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WRYCVS GGVFADVKNY 

151 KRRSK IWLTI LLTLILSCAV MD KIASDKDL RE PDAGLLLN IFDLYYDLAS 

201 XAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 



m084/a084 92.2% identity over a 231 aa overlap 

10 20 30 40 50 60 

m08 4 .pep MKQSARIKXMNQTLLYTLGICALLTFXXXXXXXXXYHPEYEYGYRYSAVGALASWFLLL 
II I ! I M I 1:111 II I I M I I : i II I I I I I I ! I I I I I I I M I I I I M 

a084 MKQSARIKNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGALASWFLLL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m084 .pep LARGFPRVSSVVLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEARE FVGNLPGSL 

I I I I I I i I I I II I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I II I I 1 I I I I I M 1 
a084 LARGFPRVSS VVLLIYVGTTALYLPVGWLYGAPSYQIVGS I LESNPAE ARE FVGNLPGSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m084 . pep YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 
I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I 
a08 4 YFVQALFFIFGLTVWRYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

130 140 150 160 170 180 



190 200 210 220 230 

m084 .pep RE PDAGLLLN I FDLYYDLAXRAGT I CRQARPHFGS SKKS VN MAY PSCCAQVX 

I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a084 RE PDAGLLLN I FDLYYDLASXAGT I CRQARPHFGS SKKS VNMAYPSCCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 265>: 

g085 . seg 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGT TGAAAGATAA 
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51 GGCAAAAGGC GTGTTCCTGA TCGGCGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AACCTGACCG ACTGCGTCAC TTTGGAAGAG 

151 GCGGTTCAGA CGGCATACGC CCAAGCCGAA GCGGGCGATA TTGTCTTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT tatCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 266; ORF 085. ng>: 

g085 .pep 

1 MGKGQDFTPL RDALKDKAKG VFLIGVDAPQ IRRDLDGCGL NLTDCVTLEE 
51 AVQTAYAQAE AGD I VLLS PA CAS FDMFKG Y AHRSEVFIEA FKAL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 267>: 

m085 . seq 

1 ATGGGTAAAG GGCAGGACTT CACGCCCCTG CGCGATGCAC TGGTAGGCAA 

51 GGCAAAAGGC GTGTTCTTGA TTGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AATATGACCG ACTGCGCCAC TTTGGGAGAA 

151 GCCGTTCAGA CGGCATATGC CCAAGCCGAA GCAGGCGATA TTGTGTTGCT 

201 CAGCCCCGCC TGCGCGAGCT TTGATATGTT CAAAGG CT AC GCGCACCGTT 

251 CGGAAGTGTT TATCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 268; ORF 085>: 

m085 -pep 

1 MGKGQDFTPL RDALVGKAKG VFLIGVDAPQ IRRDLDGCGL NMTDCATLGE 
51 AVQTAYAQAE AGD I VLLS PA CASFDMFKGY AHRSEVFIEA FKAL* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 085 shows 94.7% identity over a 94 aa overlap with a predicted ORF (ORF 085. ng) 
from N. gonorrhoeae: 

m085/g085 

10 20 30 40 50 60 

m08 5 pep MGKGQDFTPLRDALVGKAKGVFLI GVDAPQI RRDLDGCGLNMTDCATLGEAVQTAYAQAE 

IMIIIIIIMIII 1 1 i 1 1 1 1 1 1 1 J 1 1 1 1 1 1 i i 1 1 1 1 1 1 = 1 1 1 = 1 1 IMIIIIIIII 

g 0 8 5 MGKGQDFTPLRDALKDKAKGVFLI GVDAPQ IRRDLDGCGLNLTDCVTLEEAVQTAYAQAE 

10 20 30 40 50 60 

70 80 90 

m08 5 . pep AGDI VLLS PACAS FDMFKGYAHRSEVFI EAFKALX 

Ml II MM ! I! II I II I MM M l!ll II III M 

g085 AGD I VLLS PACAS FDMFKGYAHRS E VF I EAFKALX 

70 80 90 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 269>: 

a085. seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGC TTGCCGGCAA 

51 GGCAAAAGGC GTGTTCCTGA TCGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGATCTG AATATGACCG ACTGCGCCAC TTTGGAAGAA 

151 GCGGTTCAGA AGGCATATGC CCAAGCCGAA GCGGGCGATA TCGTGCTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGGGGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 270; ORF 085. a>: 

a085 .pep 

1 MGKGQDFTPL R DA LAG KAKG VFLIGVDAPQ IRRDLDGCDL NMTDCATLEE 
51 AVQKAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIGA FKAL* 

m085/a085 94.7% identity over a 94 aa overlap 

10 20 30 40 50 60 

m085 . pep MGKGQDFTPLRDALVGKAKGVFLI GVDAPQI RRDLDGCGLNMTDCATLGEAVQTAYAQAE 
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I I I It I I I I I I I M : II I I M I I ( I I I I I I M M I i I I I I I I I I I I I I I I I Mill) 
a085 MGKGQDFTPLRDALAGKAKGVFLIGVDAPQIRRDLDGCDLNMTDCATLEEAVQKAYAQAE 

10 20 30 40 50 60 

70 80 90 

m085 . pep AGDIVLLSPACASFDMFKGYAHRSEVFIEAFKALX 
I I I I I I I I I [ I I I I I I I I M I I I I I I I I I I II I I 
a085 AGDIVLLSPACAS FDMFKGYAHRSEVFIGAFKALX 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 271 >: 

g086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCGGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCGAAGG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GGCAGGCGGG GTTCGTCGTT GCCGGCCTTA TAGCGAGCGG TTTTTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CTTATCCGGC CTGTTGCTGG TAG CCGTATT GATTGCCGGG CGCGAAATCA 

2 51 ATGGCG CG AC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

3 01 GAGCTGTTCA AGCTGGCAGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 
3 51 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 
401 GGCGGGGGAC GGCCAACCTG ATTATGTCCG CCACCAATCC GCAGGCACGT 
451 CGTGAAACAT TAGAAATGTA CGgcCGTTTC CGGGCGATCA TCCTGCCGAT 
501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
551 GTTCGTTTGT CGTCATTACC GTCATTACCG TTGGAATGCT GTTTCTGGCA 
601 GGATTGCCGT GGAAATATTT TTTTGTCCTG GTAGGCAGCG TCTTGGGTGG 
651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 
701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 
751 CACTCT CTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 
801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCG CAT ACCGATTTTA 
8 51 TTTTTGCCAT CATCGCTGAA GAATTCGGCT TCTTCGGGAT GTGCGTGCTG 
901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 
951 GTCGCGCGAT TTGGGtttgA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGctgCcg t TGATGTCCT ATGGcggTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTAT CGATT 

1151 ATGAAAACCG CCAGAAAATG CGCGGTTACC GGGTGGAGTA AA 

This corresponds to the amino acid sequence <SEQ ID 272; ORF 086.ng>: 

g086 -pep 

1 MWLMTAFGL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGFLW 

51 FLCRMRTWRR LVPWIFALSG L LLVAVL I A G REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VITVGMLFLA 

201 GLPWKY FFVL VGSVLGGMVL MITA APYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

301 I FCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SF FNIGVNIG 

3 51 ALPTKGLTL P LMSYG GSSVF FMLISMMLLL RI DYENRQKM RGYRVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 273>: 

m086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACc 

301 GAGCTGTTCA AGCtGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGcGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

401 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGrCACGT 

451 CGTGAaACAT TAGAAATGTA CGGCCGTwTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
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551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGkrTCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA mCAAAgGyCT GACGCyGCCG Tg . AtGTCCw ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTkG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTATC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 274; ORF 086>: 

m086 .pep 

1 MWLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGLLW 

51 FLC RMRTWRR L VPWIFALSG LLLVWL I A G REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQXR 

151 RETLEMYGRX R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VIAVGMLFLA 

201 GLPW KYFFVL VGSVLGGMVL MIT AAPYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

301 IFCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGXQ SF FNIGVNIG 

351 ALPXKGLTX P XMSX GGSSVF FMLISMMLLX RI DYENRRKM RGYRVE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 086 shows 96.7% identity over a 396 aa overlap with a predicted ORF (ORF 086.ng) 
from N. gonorrhoeae: 

m086/g086 

10 20 30 40 50 60 

MVVLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 

I h III MMMMMMIMIMMMIMIMM I MiMIMI Ml I M M:MI I 

MWLMTAFGLLMIYSASVYLASKEGGDQFFYLTRQAGFWAGLIASGFLWFLCRMRTWRR 
10 20 30 40 50 60 

70 80 90 100 110 120 

LVPWIFALSGLLLVWLIAGREINGATRWIPLGPLNFQPTELFKLAVILiYLASLFTRREE 

MM II II I MM hill II MhlMl! IMIMINMII III 'II II l II llllll ! 

LVPWIFALSGLLLVAVLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 
70 80 90 100 110 120 

130 140 150 160 170 18 0 

VLRSME S LGWQS I WRGTANL I MS ATNPQXRRETLEM YGRXRAI I L P IMLVAFGLVLIMVQ 

I II MMI IMMI1MII IMIIIIM MIMMII! 1 1 M M M 1 1 M M 1 1 1 1 1 I 

VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 
130 140 150 160 170 180 

190 200 210 220 23 0 24 0 

PDFGSFWITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRVVAFLDPWK 

1 1 1 1 1 M I 1 1 1 1 : 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 I I 1 1 M I ! 

PDFGSFWITVITVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 
190 200 210 220 230 240 

250 260 270 280 290 300 

DPQGAGYOLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 11 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 i I ! 1 1 1 1 1 1 II 1 1 1 1 1 1 i 1 1 1 1 1 f I 

DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
250 260 270 280 290 300 

310 320 330 340 350 360 



m086 .pep 
g086 

mO 8 6 . pep 
g086 

m086 -pep 
g086 

m086 . pep 
g086 

mO 8 6 . pep 
g086 
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m08 6 pep IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 

.ill II I IN ! I II I M i I I MM I I I II I I I ! MM I I I I I I II I I M II I M I I I I 

g086 IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 

310 320 330 340 350 360 



370 380 390 

mO 8 6 . pep XMSXGGSSVFFMLI SMMLLXRIDYENRRKMRGYRVEX 

II I I 1 1 I ! I I I I I 1 I I I IMMIMMIMMI 

g 0 8 6 LMS YGGSSVFFMLI SMMLLLRIDYENRQKMRGYRVEX 

370 380 390 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 275>: 

a086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG G CATC AAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

2 51 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

301 GAGCTGTTCA AGCTGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGGCACGT 

4 51 CGTGAAACAT TAGAAATGTA CGGCCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

7 51 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGCTGCCG TTGATGTCCT ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA T C AG CAT GAT GCTGCTGTTG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTACC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 276; ORF 086. a>: 

a086 .pep 

1 MWLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGLLW 

51 FLC RMRTWRR L VPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF RA IILPIMLV AFGLVLIMV Q PDFGS FWIT V I AVGML FLA 

201 GLPWKY FFVL VGSVLGGMVL MIT AAPYRVQ RVVAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EF GFFGMCVL 

301 IFCYGWLVVR AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SFFNIGVNIG 

351 ALPTKGLTLP LMSYG GSSVF FMLISMMLLL RI DYENRRKM RGYRVE* 



m086/a086 98.0% identity over a 396 aa overlap 

10 20 30 40 50 60 

m08 6 .pep MVVLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 
I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I ! I I I I I M I I I M I M I I I I I I I I I I I I I 
a08 6 M VV LMT A FS LLMIYSAS V Y LA SKEGGDQF FY L T RQ AG FW AG L I AS G L LW FL C RMRT WRR 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 6 . pep LVPWIFALSGLLLVWLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 

I I I I I t I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I r i i I I I I I I I I II I I 

a08 6 LVPWIFALSGLLLVVVLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m086 .pep VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIILPIMLVAFGLVLIMVQ 
M I I I I I I I I I I It I I I ! I I I I I I I I I I I It I I I I I I I I I I M I II I I t I I I I I I I II 
a08 6 VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m08 6 . pep PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRVVAFLDPWK 
I I I II I M I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I | I II 
a08 6 PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m08 6 . pep DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
I I I M I I M I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I M I I I I || 
a08 6 DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m086 .pep IFCYGWLVVRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 
I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I M I M I I I II I I I I I I I II : I I I I I 
a08 6 IFCYGWLVVRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 

310 320 330 340 350 360 

370 380 390 

m08 6 . pep XMSXGGSSVFFMLISMMLLXRIDYENRRKMRGYRVEX 
II 1 I I I I I I I II M I I I I I I I I I I I I M I I I I I I 
a 0 8 6 LMS YGG S SVFFMLI SMMLLLR I DYENRRKMRG YRVEX 

370 380 390 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 277>: 

g087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACACAT 

51 TTTCCCAGCT CTGGCTGTGG CGGATTCATT GCGCGTGCGC GGTCATCATG 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 

151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGAATAC GCGGCAACGG 

201 CAT CAAACGC AAGCTGATGC TTCCGTTTAC TCTGTACAAA ACCGTCCGCG 

251 AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGTTTTG TTACCTTTCC CGGCGGTCTG GCGGCGAAAC TCTTGGGCGT 

351 GCCGATTGTG ATT CACGAG C AAAACGCCGT GGCAGGCTTG TCCAACCGCC 

401 AccTGTCGCg ctGGGCGAAA CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

451 AGCCACGAAG GCGGTTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGCGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGTGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCGT TGGCACTGCT GCCTGAAGAG GTGCGCCCGC AGATGTACCA 

651 CCAGTCGGGG CGTAACAAGC TGGGCAATCT TCAGGCGGAT TATGACGCGT 

701 TGGGCGTGAA AGCGGAATGC GTGGAATTTA TTACCGACAT GGTGTCCGCC 

751 TACCGTGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

801 CGAGTTGACG GCGGCGGGGC TGGGCGCGTT GTTAGTGCCG TATCCTCACG 

851 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTCATGGT GCAGGCAGAA 

901 GCGGGGCTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 

951 AATCCTCGGC AGCCTCAACC GCGAAAAATG CCTCAAATGG GCGGAAAACG 

1001 CCCGTACGTT GGCATTGCCG CACAGCGCGG ATGACGTTGC CGAAGCCGCG 

1051 ATTG CGTGTG CGGCGTAAA 

This corresponds to the amino acid sequence <SEQ ID 278; ORF 087.ng>: 

g087 -pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADS LRVR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGIRGNGIKR KLMLPFTLYK TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 
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201 PQALALLPEE VRPQMYHQSG RNKLGNLQAD YDALGVKAEC VEFITDMVSA 
251 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 
3 01 AGLLLPQTQL TAEKLAEILG SLNREKCLKW AENARTLALP HSADDVAEAA 

3 51 IACAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 279>; 

m087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGAwkkCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GGCCATCATG 

101 TGATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGTAT CGTGCCGCAA 

151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

2 01 CATCAAACGC AAACTGATGC TGCCGGTTAC TTTGTATCAA ACCGTCCGCG 
251 AAGCGCAGCG GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 
301 GGCGGCTTCG TTACCTTCCC CGGCGGTTTG GCGGCGAAGC TATTAr G CGT 
351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 
401 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

4 51 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG AT ATT AG CAA 
501 CCTGCCCGTG CCTGCCGAAC GCTT CCAAGG GCGTGAAGGC CGTCTGAAAA 
551 TTTTGGTGGT CGGCGG CAGT TTGGGCGCGG ACGTTTTGAA CAAAAC CGTA 
601 C CGCATGC AT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC ATATGTACCA 
651 CCAATCGGGA CGGGGCAAGC TGGGCATCTT GCAGGCGnnn nnnnnnnnnn 
701 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
751 nnnGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG CGGTTGACGA 
801 TCACCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG GCGGGATTGC 
851 TGTTGCCGCA AACC CAGTTG ACGGCGGAAA AACTCGCCGA GATTCTCGGC 
901 GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG CCCGTACGTT 
951 GGCACTGCCG CACAGTGCGG ACGACGTGGC GGAAGCCGCG ATTG CGTGTG 

1001 CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 280; ORF 087>: 

m08 7 .pep 

1 MGGKTFMLXX GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGVRGNGIKR KLMLPVTLYQ TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AAKLLXVPIV I HEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 P HALALL PDN ARPHMYHQSG RGKLGILQAX XXXXXXXXXX XXXXXXXXXX 

251 XAGLG ALL V P YPHAVDDHQT ANARFMVQAE AGLLLPQTQL TAEKLAEILG 

3 01 GLNREKCLKW AENARTLALP HSADDVAEAA IACAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 087 shows 83.9% identity over a 355 aa overlap with a predicted ORF (ORF 087.ng) 
from N. gonorrhoeae: 

m087/g087 

10 20 30 40 50 60 

m087 .pep MGGKTFMLXXGGTGGH I FPALAVADSLRARGHHV I WLGSKDSMEERIVPQ YGIRLETLAI 

1 1 1 1 1 It I IMIlMlliMIMMhIIIIIMIIIMMIMIIIIIMIMIlll 

g087 MGGKTFMLMAGGTGGH I FPALAVADSLRVRGHHVI WLGSKDSMEERIVPQ YGIRLETLAI 

10 20 30 40 50 60 

70 80 90 100 110 120 

m0 8 7 . pep KGVRGNGIKRKLMLPVTLYQWREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLXVPIV 

Ihllllllllllll I I I ^ I I I 1 I f E 1 I I I f I f I I I I I 1 I f I I I I I J E I I I I f I MM 
g087 KGIRGNGIKRKLMLPFTLYKTVREAQRI IRKHRVECVIGFGGFVTFPGGLAAKLLGVPIV 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 8 7 . pep I HEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRAD I SNLPVPAERFQGREG 

MIMMMMMMMM MMMMMMMIMMMMMMMMMIM MMI 

g 0 8 7 IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADI SNLPVPAERFQGREG 

13 0 14 0 150 16 0 170 180 
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190 200 210 220 229 
m087 . pep RLKILWGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQA 

!M I II M I I MM I M II I M Mill M-| I MMI M IM M Ml 

gO 8 7 RLKILWGGSLGADVLNKTVPQALALLPEEVRPQMYHQSGRNKLGNLQADYDALGVKAEC 

190 200 210 220 230 240 



230 240 250 

mO 8 7 . pep AG L GAL LVPY PHA VDDHQT AN AR FMVQAE 

M IMMMIMMM li I Ml II MMI II 

gO 8 7 VEFITDMVSAYRDADLVICRAGALTI AELTAAGLGALLVPYPHAVDDHQTANARFMVQAE 

250 260 270 280 290 300 



260 270 280 290 300 310 

mO 8 7 . pep AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAE7VAI ACAAX 

I I I I I I I I I I I I I I I I I i I h I I I I I I I I I I I I I I I I I I I I I II M M I I I I II II 
g087 AGLLLPQTQLTAEKLAEILGSLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 28 1>: 

a087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GG C CAT CAT G 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAG AG C G CAT CGTGCCGCAA 

151 TACGACATCC TGCTCGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

201 CAT CAAACGC AAGCTGATGC TGCCGTTTAC TTTGTATCAA ACTGTCCGCG 

251 AAGCGCAGCA GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGCTTCG TTACCTTTCC CGGCGGTTTG GCGGCGAAGT TATTAGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

4 51 AG C C AC G AAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG AT AT TAG C AA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC AGATGTACCA 

651 CCAATCGGGA CGGGGCAAGC TGGGCAGCTT GCAGGCGGAT TACGACGCGC 

7 01 TGGGCGTGCA AGCGGAATGC GTGGAATTTA TTACCGATAT GGTGTCCGCC 

7 51 T AC CG C G AT G CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

8 01 CGAGTTGACG GCGGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG 
8 51 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG 
901 GCGGGATTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 
951 GATTCTCGGC GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG 

1001 CCCGTACGTT GGCACTGCCG CACAGTGCGG ACGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAA 



This corresponds to the amino acid sequence <SEQ ID 282; ORF 087. a>: 

a087 .pep 

1 MGGKT FMLMA GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YDILLETLAI KGVRGNGIKR KLMLPFTLYQ TVREAQQIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PQALALLPDN ARPQMYHQSG RGKLGSLQAD YDALGVQAEC VEFITDMVSA 

251 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT AN AR FMVQAE 

301 AGLLLPQTQL TAEKLAEILG GLNREKCLKW AENARTLALP HSADDVAEAA 

351 IACAA* 



m087/aO87 85.4% identity over a 355 aa overlap 

10 20 30 40 50 60 

m087 . pep MGGKTFMLXXGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 
MINIM I I M M M M M II I M M M I M I I I I II M II I M I I I I I I II I I I 
a087 MGGKTFMLMAGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYDILLETLAI 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m087 . pep KGVRGNGIKRKLMLPVTLYQTVREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLXVPIV 
I I I I I I I M I I I I I i I I I M I E I I I : M I I II I I I M I I I I I II I I I M I I I I I I I I I 
a087 KGVRGNGIKRKLMLPFTLYQTVREAQQIIRKHRVECVIGFGGFVTFPGGLAAKLLGVPIV 

70 80 90 100 110 120 

130 140 150 160 170 180 

m087 . pep IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 
I I I I I I I I It I I I I I I I I I 1 I M I I I I I I II I I I I I I I I I I I II I I I I I I I M I M I M I 
a0 87 IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGN PVRADISNLPVPAERFQGREG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m087 . pep RLKILVVGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQAXXXXXXXXXXX 

I !! M M II I I f II M I I I J I : I I I I M I I I I I : I I I I I I I I I I I III 
a087 RLKILVVGGSLGADVLNKTVPQALALLPDNARPQMYHQSGRGKLGSLQADYDALGVQAEC 

190 200 210 220 230 240 

250 260 270 280 

m 087 . pep XX XXXXXXXXX AG LGAL LV P Y P H AV D DH QT AN AR FMVQAE 

: : I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

a 087 VEFITDMVSAYRDADLVICRAGALTIAELTAAGLGALLVPYPHAVDDHQTANARFMVQAE 

250 260 270 280 290 300 

290 300 310 320 330 

m087 . pep AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX. 

M I M I I I f I I M I M I I I I I I I I I I I I I I I I I I I I I I II I I I M I 1 I M I I I I I I 
a087 AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

310 320 330 340 350 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 283>: 



g088 . seq 




1 


ATGTTTTTAT 


51 


TTTTCAATAC 


101 


CGTTTTCCCT 


151 


AAATGCGGGC 


201 


AAACGGCACG 


251 


TGTCCACCCT 


301 


TTGGGCGTAC 


351 


CAAAGTCGTT 


401 


TGTGGCAGTC 


451 


gCcgcCAATT 


501 


CGCCCTGCCG 


551 


TCGTCGGCAC 


601 


GCCGCcttCC 


651 


CGTCAGCGGA 


701 


CCGGCGCGAA 


751 


CTCGGATTTT 


801 


TGTCGGCGCG 


851 


t CCGCCAAGA 


901 


gccgtgTCCG 


951 


CATCTTcCTg 


1001 


aagaaacgca 


1051 


tTgataggtt 


1101 


ACCTTTCAGA 



CTGGCATTGG GTGCCGCGCT CGGTaccGtt gCCGTcaTcg 



3CATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 284; ORF 088.ng>: 

g088 .pep 

1 MFLWLAHFSN WLTGLNI FQY TTFRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLIiATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSANNILI VPFFKQIALP LGWGFLVLS YLTIVGTSNA VNLTDGLDGL 
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201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA I FCTAMCGAC 

251 LGFLW FNAYP AQ V FMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSVM LHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIWV 

351 LIGL STLKIR GNYAVRTPFR RHLNAQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 285>: 

m0 88 . seq 



1 


ATGTTTTTAT 


GGCTCGCACA 


TTTCAGCAnC 


TGGTTAACCG 


GTCTGAATnn 


51 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


101 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


151 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


201 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


251 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


301 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


351 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


401 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


451 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


501 


nnnnnnnnnn 


nnnGGCGTGG 


TCGGCTTTTT 


GGTGTTGTCT 


TACCTGACCA 


551 


TCGTCGGCAC 


ATCCAATGCC 


GTCAACCTCA 


CCGACGGCTT 


GGACGGCCTT 


601 


GCGACCTTCC 


CCGTCGTCCT 


CGTTGCCGCC 


GGCCTCGCCA 


TCTTCGCCTA 


651 


TGCCAGCGGC 


CACTCACAAT 


TTGCC CAATA 


CCTGCAATTA 


CCTTACGTTG 


701 


CCGGCGCAAA 


CGAAGTGGTG 


ATTTT CTGTA 


CCGCCATGTG 


CGGCGCGTGC 


751 


CTCGGTTTCT 


TGTGGTTTAA 


CGCCTATCCC 


GCGCAAGTCT 


TTATGGGCGA 


801 


TGTCGGTGCA 


TTGGCATTGG 


GTGCCGCGCT 


CGGTACCGTC 


GCCGTTATCG 


851 


TCCGCCAAGA 


GTTTGTCCTC 


GTCATTATGG 


GCGGATTATT 


TGTCGTAGAA 


901 


GCCGTATCCG 


TTATGCTTCA 


GGTTGGCTGG 


TATAAGAAAA 


CCAAAAAACG 


951 


CATCTTCCTG 


ATGGCGCCCA 


TCCATCACCA 


C TACGAACAA 


AAAGGCTGGA 


1001 


AAGAAAC CCA 


AGTCGTCGTC 


CGCTTTTGGA 


T TAT T AC CAT 


CGTCTTGGTG 


1051 


TTGATCGGTT 


TGAGTACCCT 


CAAAATCCGC 


TGAACCTATG 


CCGTCTGAAC 


1101 


ATCTTTCAGA 


CGG CATTTGA 


ACGCGCAATA 


A 





1 MFLWLAHFSN WLTGLNIFQY TTFRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANP YIWIL 

101 LGVLIiATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSAN NILI VPFFKQI ALP LGWGFLVLS YLT I V GTSN A VNLTDGLDGL 

201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

3 01 AVSVM LHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIWV 

3 51 LIGLSTLKIR GNYAVRTPFR RHLNAQ* 



This corresponds to the amino acid sequence <SEQ ID 286; ORF 088>: 

m0 8 8 . pep 

1 MFLWLAHFSX WLTGLNXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

101 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPWLVAA GLAI FA YASG HSQFAQYLQL PYVAGANE W I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSVM LQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ WV RFWIITIVLV 

3 51 LIGL STLKIR XT YAVXTS FR RHLNAQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 088 shows 91 .7% identity over a 205 aa overlap with a predicted ORF (ORF 088.ng) 
from N. gonorrhoeae: 

m088/g088 



10 20 30 

m0 8 8 . pep GWGFLVLS YLTI VGTSNAVNLTDGLDGLA 

I I II I I il ill I I I IN I II Ml II I I M i 

g0 8 8 IAGLALFYLAANSANNI LI VPFFKQIALP LGWGFLVLS YLTI VGTSNAVNLTDGLDGLA 

150 160 170 180 190 200 
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40 50 60 70 80 90 

m0 8 8 . pep T F PWLVAAG LA I FAYASGHSQFAQYLQLPYVAGANEWI FCTAMCGACLGFLWFNAYPA 

Ml ! I I M I! ! Ill h!M I I = 1 1 f I I ! I I I I ( 1 1 i ^ I E 1 1 f I 1 1 I 1 1 I i 1 I t I 1 1 1 1 

g088 AFPFVLVAAGLAI FAYVSGHYQFSQYLQLPYVAGANEVAI FCTAMCGACLGFLWFNAYPA 

210 220 230 240 250 260 



100 110 120 130 140 150 

mO 8 8 . pep QV FMGD VG ALALGAALGTV AV I VRQE FVLV I MGGLFWE AVS VMLQVGW YKKT KKR I FLM 

II I IIIIM IMMII IIIIMMIMI .1 illillll II IMIhMMM II ! INI 

gO 8 8 QVFMGDVGALALGAALGTVAVI VRQEFVLVIMGGLFWEAVSVMLHVGWYKKTKKRI FLT 

270 280 290 300 310 320 



160 170 180 190 200 

m088 . pep APIHHHYEQKGWKETQVWRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 

llllllh : I II I I Ml li I II I: Ml I 1 II I II M :||| I lllllllll 
gO 8 8 AP IHHHYQLRCWKETQVWRFWI ITI VWLIGLSTLKI RGNYAVRTP FRRHLNAQX 

330 340 350 360 370 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 287>: 

a088 . seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTCAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGT CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

251 TGTCCACCCT GTTGTGGGGC AACTGGGCAA ACCCGTATAT CTGGATTCTC 

301 TTGGGCGTAT TGCTCGCCAC GGGCGCACTC GGTTTTTACG ACGACTGGCG 

351 CAAAGTCGTC TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 

4 01 TGTGGCAGTC AAGCGTTGCC ATTATCGCCG GTTTGGCATT GTTTTACCTT 

4 51 GCCGCCAATT CCGCCAACAA TATTTTGATT GTCCCGTTCT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTTTT GGTGTTGTCT TACCTGACCA 

551 TCGTCGGCAC ATCCAATGCC GTCAACCTCA CCGACGGCTT GGACGGCCTT 

601 GCGACCTTCC CCGTCGTCCT CGTTGCCGCC GGCCTCGCCA TCTTCGCCTA 

651 TGCCAGCGGC C ACT C ACAAT TTGCCCAATA CCTGCAATTA CCTTACGTTG 

701 CCGGCGCAAA CGAAGTGGTG ATTTTCTGTA CCGCCATGTG CGGCGCGTGC 

7 51 CTCGGTTTCT TGTGGTTTAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 

801 TGTCGGTGCA TTGGCATTGG GTGCCGCGCT CGGTACCGTC GCCGTCATCG 

851 TCCGCCAAGA GTTTGTCCTC GTCATTATGG GCGGATTATT TGTCGTAGAA 

901 GCCGTATCCG TTATGCTTCA GGTCGGCTGG TATAAGAAAA CCAAAAAACG 

951 CATCTTCCTG ATGGCGCCCA TCCATCACCA CTACGAACAA AAAGGCTGGA 

1001 AAGAAACCCA AGTCGTCGTC CGCTTTTGGA T TAT TAG CAT CGTCTTGGTG 

1051 TTGATCGGTT TGAGTACCCT CAAAATCCGC TGAACCTATG CCGTCTGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 288; ORF 088.a>: 

a088 .pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL G F YD DWRKVV YKDPNGVSAK FKMVWQSS VA IIAGLALFYL 

151 AANSA NNILI VPFFKQI ALP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPVVLVAA GLAIFA YASG HSQFAQYLQL PYVAGANE VV IFCTAMCGAC 

251 LGFLWFNAYP AQVFMGD VGA LALGAALGTV AVI V RQE FVL VIMGGLFWE 

301 AVSVMLQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ VVV RFWIITIVLV 

351 LIGLSTLKIR *TYAV*TPFR RHLNAQ* 



m088/a088 99.5% identity over a 205 aa overlap 



150 160 170 180 190 200 

IT1088 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 

I i I I I I I I I I II I I I i i I I I I II I I I I I I I 
a088 IAGLALFYLAAN SANNILIVPFFKQIALPLGVVGFLVLSYLTIVGTSNAVNLTDGLDGLA 
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150 160 170 180 190 200 

210 220 230 240 250 260 

m088 .pep TFPVVLVAAGLAIFAYASGHSQFAQYLQLPYVAGANEWIFCTAMCGACLGFLWFNAYPA 
I I I I i j I I I t I I I I I I I I i I I I I I II I I I M I I I I I I I I I I I I I M M M J I I J I I j I I I 
a08 8 TFPVVLVAAGLAIFAYASGHSQFAQYLQLPYVAGANEVVIFCTAMCGACLGFLWFNAYPA 

210 220 .230 '240 250 260 

270 280 290 300 310 320 

m088 . pep QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFVVEAVSVMLQVGWYKKTKKRIFLM 
I I I I I I I I I I M I I I I I I I I I I I I I I I M II I I I I II I I I I ! I I I II I I I I M II I I I M 
a088 QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLQVGWYKKTKKRIFLM 

270 280 290 300 310 320 

330 340 350 360 370 

m088 .pep APIHHHYEQKGWKETQVVVRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M M I I I I I I I I I I I I I I I I M I II 
a08 8 APIHHHYEQKGWKETQVVVRFWIITIVLVLIGLSTLKIRXTYAVXTPFRRHLNAQX 

330 340 350 360 370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 289>: 

g089 . seq 

1 ATGCCGCCCA AAATCACGAA GAGCGGGTTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCG C CG AC ATT CG TGCCTTTGCT GTCGTCGATG AATACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACACGGT GCGGCAAGCC TTGGAAGGTT 

151 TTGACGTGTT CCAGCAATGC TTCGCGCGGC AAACCGACGG CCTCGCACAA 

201 AGCCACGGCA GCCATAACGT TGGCGGCGTT GTGCAAACCT TGCAGCGGGA 

251 TGTCTTGCGT AGAAATCAAA TCTTCATTGC CTTGTTTTAA ACAGCCCGTC 

3 01 CCGCGTTCCA ACCAAAAATC GGCTTCGTGT TCCAAGGAAA ACCGTTTCAC 

351 TTCACGCCCT GCCCGTTTCA TGGCGCGGCA GAACACGTCG TCCGCATTCA 

401 AAAC CTGCAC TCCATCGCCA CGGAAAATCT CGGCTTTGGT ATG CGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 290; ORF 089.ng>: 

gO 8 9 -pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGKPWKV 

51 LTCSSNASRG KPTASHKATA AITLAALCKP CSGMSCVEIK SSLPCFKQPV 

101 PRSNQKSASC SKENRFTSRP ARFMARQNTS SAFKTCTPSP RKISALVCA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 291>: 

m089 . seq 

1 ATGCCGCCCA AAATCACkAw GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCAGTCGCG C CG AC ATT CG TGCCTTTGCT GTCGTCGATA AACACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGGAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAAC CGATGG CCTCACACAA 

201 AGCCACkGCA GCCATGACGT TAG CGGCGTT GTGCAkACCT TGCAACGGwA 

251 TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTT C AG GCGGCCTGTC 

3 01 TCGCGTTCCA ACCAGAAATC AGCT TCGTGT TCCAACGAAA ACCATTTTAC 

3 51 CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG TCCGCATTCA 

4 01 AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCATAG 

This corresponds to the amino acid sequence <SEQ ID 292; ORF 089>: 

1X1089 .pep 

1 MPPKITXSGF CKP AIAAAVA PTFVPLLSSI NTTPFFSPIF STRCGRPWKV 

51 LTCSSNASRD KPMASHKATA AMTLAALCX P CNGMSCVTIK SSLPCFRRPV 

101 SRSNQKSASC SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 089 shows 88.6% identity over a 149 aa overlap with a predicted ORF (ORF 089.ng) 
from N. gonorrhoeae: 

m089/g089 
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10 20 30 40 50 60 

m08 9 .pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

llllll I I I I I IMM MIMIMI I II Ml I I M Ml M MM M I M Ml I I I I I I 

g08 9 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGKPWKVLTCSSNASRG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 9 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 

II IIIIMIIMIMM IIMIIII I I M i I I ! : : I I I I I I II M I I : M : I I I I I 
g 0 8 9 KPTASHKATAAI TLAALCKPCSGMS CVE I KS SLPC FKQPVPRSNQKS AS CS KENRFTS RP 

70 80 90 100 110 120 



130 140 150 

m08 9 . pep ARF I ARQNASS AFKTCTPS PRKI LALVCAX 

IIIMIMMIIIMIMIIIII MINI 

g 0 8 9 ARFMARQNTS S AFKTCTPS PRKI SALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 293>: 

a089. seq 

1 ATGCCGCCTA AAATCACGAA GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCA CCGACGTTCG TGCCTTTGCT GTCGTCGATG AACACCACGC 

101 CATTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGAAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGGC AAACCGACGG CTTCGCACAA 

201 GGCAACGGCA GCCATCACGT TAGTGGCGTT GTGCAAGCCT TGCAGCGGAA 

251 TATCTTGCGT GGCAATCAAA TCTTCATTGC CTTGTTTCAG GCGACCTGTC 

301 TCACGTTCCA ACCAAAAATC GGCTTCGTAT TCCAACGAAA ACCATTTCAC 

351 CTCGCGCCCG GCGCGCTTCA TCGCACGACA GAACGCATCG TCCGCATTCA 

4 01 AAACCTGCAC ACCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 294; ORF 089.a>: 

a089.pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGRP*KV 

51 LTCSSNASRG KPTASHKATA AITLVALCKP CSGISCVAIK SSLPCFRRPV 

101 SRSNQKSASY SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 



m089/a089 91 .9% identity over a 149 aa overlap 

10 20 30 40 50 60 

m08 9 . pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

llllll II t I I I I M M I I I I I I I I I I I : I I I I I I I I I I I I I I II I I I I I I I I I I I I 
a08 9 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGRPXKVLTCSSNASRG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 9 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 
II I I M I I I I : I I : I II I I : I : I I I : I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I 

a08 9 KPTASHKATAAITLVALCKPCSGISCVAIKSSLPCFRRPVSRSNQKSASYSNENHFTSRP 

70 80 90 100 110 120 



130 140 150 

m08 9 . pep ARFIARQNAS SAFKTCT PS PRKI LALVCAX 

I I I I II I I I I I I I I I I I M I I I II I I I M I 
a08 9 ARF I ARQNASS AFKTCTPS PRKI LALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 295>: 

g090 . seq 

1 ATGCGCGTAG TCGAGCAAAT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 
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51 TGTTCATCAC CGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCcca CACCCACACG TCCGCCTTTT TGCCTTCgcg 

151 ctgCAATtct gcctccaaga cgggcgtacc gatATTGCCC GCAATGAcgg 

201 tatccagccc gcacttgatg CAGAGatagc ggaccaggct ggttaccgTG 

251 GTTttgccgt tgctgCcggt aatcgCaatc accttgtcgC CGCGGCGGtt 

3 01 cAcaaTGTCc gccaGCAATt ggATGTCGCC TAgCACGCGC . ccgccgTTT 

351 TGCttga 

This corresponds to the amino acid sequence <SEQ ID 296; ORF 090.ng>: 

g090 -pep 

1 MRWEQIWA VEMV FGNVHH RRRSRAQAFG VFQLEAGKLP HPHVRLFAFA 

51 LQFCLQDGRT DIARNDGIQP ALDAEIADQA GYRG FAVAAG NRNHLVAAAV 

101 HNVRQQLDVA XHAXRRFA* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 297>: 

m090.seq 

1 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

2 01 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 
251 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

3 01 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT .CCGCCGTTT 
3 51 TGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 298; ORF 090>: 

m090 .pep 

1 MRIVEQWVA VEMV FGNVQH RRRS RTQAFG VFQLEAGKLQ HPHVRLFAFA 

51 LPFRLQNRRA DIARDNGIQP ALDTE I ADQA RYRGFAVAAG NRNYLWPAV 

101 HNVRQQ FD V A QHAXRRFA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 090 shows 83.9% identity over a 1 18 aa overlap with a predicted ORF (ORF 090.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

MRIVEQVWAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 
I h M hill II I ill M HI II I hill 'I'lill li I IIIIIIIMM | ||: |: 
MRWEQIWAVEMVFGNVHHRRRSRAQAFGVFQLEAGKLPHPHVRLFAFALQFCLQDGRT 
10 20 30 40 50 60 

70 80 90 100 110 119 

D IARDNG I QP ALDTE I ADQARY RGFAVAAGNRN Y LWPAVHNVRQQFDVAQHAXRRFAX 

I II hh I h ! I hil 1 1 I I lllllllllllhlh lllllllhlll lllllill 

DI ARNDGI QPALDAE I ADQAGYRGFAVAAGNRNHLVAAAVHNVRQQLDVAXHAXRRFAX 
70 80 90 100 110 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 299>: 

a090.seq 

1 ATGCGCGTAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC TGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAACTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCAATTCC GCCTCCAAAA CCGGCGCGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC ACACTTGATG CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATC ACCTTGTCGC CGCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTT 

351 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 300; ORF 090.a>: 



m090/g090 

m090 . pep 
g090 

m090 .pep 
g090 
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at) 90 .pep 

1 
51 
101 



MRVVEQVVVA VEMVFGNVQH CRRSRAQAFG VFQLETGKLQ HPHVRLFAFA 
LQFRLQNRRA DIARDNGIQP TLDAEIADQA RYRGFAVAAG NRNHLVAAAV 
HNVRQQFDVA QHAXRRFA* 



m09/a090 91.5% identity over a 1 17 aa overlap 

10 20 30 40 50 60 

m090 . pep MRIVEQVVVAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 
M : I I M I I t I I I I I M I M I M I : I ! I I I! I II : I I I I I I I I I I I I I M I I I I I I I I 
a090 MRVVEQVVVAVEMVFGNVQHCRRSRAQAFGVFQLETGKLQHPHVRLFAFALQFRLQNRRA 

10 20 30 40 50 60 



70 80 90 100 110 119 

m090 . pep DIARDNGIQPALDTEIADQARYRGFAVAAGNRNYLVVPAVHNVRQQFDVAQHAXRRFAX 
I I I I M 1 I M : I I : I I I I I I I t I M ) I I I I I I I : I I : I I 1 I I II I I I I I I I it I I I I I 
a 090 DI AR DNG I Q PT LD AE I ADQAR YRG FAVAAGNRNHLVAAAVHN VRQQ F DVAQH AXRR FAX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae 

g090-l.seq This sequence contains multiple stop codons (not shown) 

This corresponds to the amino acid sequence < ORF 090-1 .ng>: 

gO 90-1. pep (not shown) 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 2>: 



m090-l . seq 

1 ATGACGGCGT 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 



TAAACACTTC 
CAGGCGGGGC 
CGTCTGAACG 
AGCCGTCCAA 
ACGCCTTTTG 
CCCTTTACCC 
CAATCGCGGC 
AACACCACGC 
TTGGAAGGTT 
CCTCACACAA 
TGCAACGGAA 
GCGGCCTGTC 
ACCATTTTAC 
TCCGCATTCA 
ATGCGCATAG 
TGTTCAGCAC 
TGGAAGCTGG 
CTGCCATTCC 
TATCCAGCCC 
GTTTTGCCGT 
CACAATGTCC 
GCTTGAACGC 
GCCAGAATAT 
AAACACCAAC 
TCAGCTCCGC 
GCAATCATGG 
GTTTTGAAAA 



TTGCATTTCA 
CGAACGGTGC 
TGAGCAACAC 
GCTTCTCCCA 
ATCGCGGCGG 
CCTTGCCTAC 
ATGCCGCCCA 
GGCAGTCGCG 
CGTTTTTCTC 
TTGACGTGTT 
AGCCACGGCA 
TGTCTTGCGT 
TCGCGTTCCA 
CTCGCGCCCG 
AAACCTGCAC 
TCGAGCAAGT 
CGTCGCCGCA 
AAAGCTCCAA 
GCCTCCAAAA 
GCACTTGATA 
TGCTGCCGGT 
GCCAGCAATT 
CTCAATATCC 
CGAAACCGTT 
CCGTCAAACA 
ATCATACGCA 
AAATACCCGT 
GTCATTTTGG 



GACGGCATCA 
GCGTAGCCTT 
AATATCGCCT 
AAGTGGCGCA 
ATTTGCGGCG 
CAGTGCATCG 
AAATCACGAA 
CCGACATTCG 
GCCGATTTTT 
CGAGCAATGC 
GCCATGACGT 
GACAAT C AAA 
AC C AG AAAT C 
GCACGCTTCA 
GCCGTCGCCA 
CGTCGTAGCG 
GTCGGACGCA 
CACCCACACG 
CCGGCGTGCC 
CAGAGATAGC 
AATCGCAATT 
CGATGTCGCC 
GGCTGCCGCT 
GTCCAGCGCA 
TCTTACCGAT 
GCAACCTCCG 
ACCGCCGAGT 
TTTGTCCTAA 



CAAAGCCTTA 
TGAACATATC 
GCTTCGGCTT 
GTCGGTCATA 
CATCGACACC 
CGCAGGGGCG 
GAGCGGATTT 
TGCCTTTGCT 
TCCACGCGGT 
TTCGCGCGAC 
TAGCGGCGTT 
TCTTCATTGC 
AGCTTCGTGT 
TCGCGCGGCA 
CGGAAAATCT 
GTCGAGATGG 
GGCTTTCGGT 
TCCGCCTTTT 
GATATTGCCC 
CGACCAGGCT 
ACCTTGTCGT 
CAACACGCGT 
CGCTGATGCC 
TCTTTCAGAC 
TTGCGACACG 
CGCCGTTTTT 
CCGGCGACGA 



AACGCTTCGA 
AAAGCTCGCG 
GGGCATATGC 
TTCAAGCCGC 
AAT C AAGAAC 
TGAAGTCCTG 
TGCAAACCGG 
GTCGTCGATA 
GCGGCAGGCC 
AAACCGATGG 
GTGCAGACCT 
CTTGTTTCAG 
TCCAACGAAA 
GAACGCATCG 
TGGCTTTGGT 
TCTTCGGAAA 
GTTTTCCAGT 
TGCCTTCGCG 
GCGATAACGG 
CGTTACCGTG 
CCCGGCGGTT 
CCGCCGTTTT 
GGGACTGAGA 
GGCCCGTGTA 
CGTTCCGGCT 
GCGCAGGTAG 
GGATTTTTTT 



This corresponds to the amino acid sequence <SEQ ID 3; ORF 090- 1>: 

m0 9 0 - 1 .pep 

1 MTAFAFQTAS QSLKRFDKHF RTVRVAFEHI KARAGGAEQH NIACFGLGIC 

51 RLNGFSQSGA VGHIQAAAVQ IAADLRRIDT NQEHAFCLAY QCIAQGREVL 

101 PFTHAAQNHE ERILQTGNRG GSRADIRAFA VVDKHHAVFL ADFFHAVRQA 

151 LEGFDVFEQC FARQTDGLTQ SHGSHDVSGV VQTLQRNVLR DN QIFIALFQ 

201 AACLAFQPEI SFVFQRKPFY LAPGTLHRAA ERIVRIQNLH AVATENLGFG 

251 MRIVEQVVVA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

301 LP FRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLVVPAV 

3 51 HNVRQQFDVA QHASAVLLER LNIRLPLADA GTESQNIETV VQRI FQTARV 

4 01 KHQPVKHLTD LRHAFRLQLR IIRSNLRAVF AQVGNHGNTR TAESGDEDFF 
4 51 VLKSHFGLS* 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 303>: 

g091 . seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAGTCATTTT GGTTTTGTCC TAAAACAAAT CATATTGGGC AGGAGACGTC 

101 CGCCCTTGCC CAAGCCGCTT TCAGACGG C A TCGCGAGCCG ATTAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCT GTTTTGGTCT C CGTTTT G AC 

201 AAGCCTTGCC AAGCCATTGT TGAGCGAGCG CAAGGTCTTG GCGCACGCCG . 

251 CGTCCATCGT AATACATCAA GCCCAAATTG TATTGGGCTT GGGCATCCCC 

3 01 TTGTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 304; ORF 091.ng>: 

g091 . pep 

1 MEIPVPPSPA TRIFLFESHF GFVLKQTILG RRRPPLPKPL SDGIASRLIT 
51 RLQ ALVIVAA VLVSVLTSLA KPLLSERKVL AHAAS IVIHQ AQIVLGLGIP 
101 LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 305>: 

m091 . seq 

1 ATGGAAATAC CCGTACCGCC GAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAAGTCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGAGATGTC 

101 CGCCCCTGCC CAAGCCGCTT TCAGACGG C A TCGCGAGCTG TTCAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCC GTCTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAACCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGCTT GGGCTACCCC 

301 CTGCGC. . . 

This corresponds to the amino acid sequence <SEQ ID 306; ORF 091>: 

m0 9 1 . pep 

1 MEIPVPPSPA TRIFLFEKSF WFVLKQIILS RRCPPLPKPL SDGIASCSIT 
51 RLQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASF GIHH AQIVLGLGYP 
101 LR. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 091 shows 84.2% identity over a 101 aa overlap with a predicted ORF (ORF 091.ng) 
from N. gonorrhoeae: 

m091/g091 



10 20 30 40 50 60 

m091 . pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 

MIMMMMMMIM I II 1 1 1 1 ! I ^ I I 1 1 1 I 1 1 I M 1 1 1 1 MINIMUM 

g091 MEIPVPPSPATRIFLFESHFGFVLKQIILGRRRPPLPKPLSDGIASRLITRLQALVIVAA 

10 20 30 40 50 60 



70 80 90 100 

m0 9 1 . pep VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLGLGYPLR 

MMIMIIIIMI : 1111111= IhlllllMI M 

g0 91 VLVSVLTSLAKPLLSERKVLAHAASIVIHQAQIVLGLGIPLFX 

70 80 90 100 



The following partial DNA sequence was identified in N. meningiditis <SEQ ID 307>: 

a091 . seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTTG 

51 G AAAT CAT T T TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGGGATGTC 

101 TGATCCTGCT CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCTTTCAGG CGTTGGTCAT TGTCGCAGCT GTCTTGGTAT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGC 
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This corresponds to the amino acid sequence <SEQ ID 308; ORF 091. a>: 

a091 .pep 

1 MEIPVPPSPA TRIFLFWKSF WFVLKQIILS RGCLILLKPL SDGIASCSIT 
51 RFQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASFGIHH AQIVLG 

m09i/a09i 93.8% identity over a 96 aa overlap 

10 20 30 40 50 60 

m091 .pep MEIPVPPSPATRI FLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 

| I M I I II f I I I M I I I I M I I i I 1 I I I I I I i I I I I I II I I 1 I I I I : II I ! M I I 
a 0 91 MEIPVPPSPATRI FLFWKSFWFVLKQIILSRGCLILLKPLSDGIASCSITRFQALVIVAA 

10 20 30 40 50 60 

70 80 90 100 

m0 91 .pep VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLGLGYPLR 

I I I I I I I I I M I I I I M I I I I II I I I M II I I I I I I 
a0 91 VLVSVLTSLAKPFLCKGAVLAHAAS FGIHHAQIVLG 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 309>: 

g092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGTGCGC 

51 AAACGGTCAG AC CTTTAAAA TAACGCCTTT ACGCACTAAA AACCAACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTAA GCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGAGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 

3 01 AACGGTgcgg ATGTCGTCGT TGCCTCTACC GCCGTCAAGA AAGAAa atcC 
351 CGAAGTtgtc gcTGCGTTGG AGCGGCAAAT TCCCGTTATT CCGCGCGCCT 

4 01 TGATGCTGGC AGAG CTGATG CGCTTCCGTG ACGgcatcgc cattgccggT 
4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
501 GG CAGG ACT C GACCCCACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 
551 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 
601 GAATCCGATG CCTCTTTCCT ACATCTGACC CCGATTATGT CCGT CGTTAC 
651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 
701 TG CAT CAGG C GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 
751 GCCTTTTTGT GTGTTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 
801 GAGCAAACCT TATGCT AC TT ACGGTTTGGA CGATACCGCC GACATCTACG 
851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATT C AC CGTCCATGTT 
901 CAAATGAAAG GACATGAG C A GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
951 CGG CAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGc gtggcGCTgg 

1001 aagtcGgCGC ATcggttgAA GCGAtcCAAA AaggCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATAcggc gacatCAagt tgccaaacgg 

1101 cggGaccgCT TTgctGGTGG ACGATTAcgg ACACCACCCC GTCGAAATGG 

1151 CGGcaaccct tgccgcTGCA CGCGGCGCGT AT C CGGAAAA acgtTTGGTG 

1201 CtcgCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTACTCAATA CCGTTGatgC GCTGGTACTG ACCGAAGTTT 

13 01 AtgccgccgG CGAAGAGCCG GTTGCCGCCG CCGactcCCG CGCCTTGGCG 

13 51 CGTGCTATCC GCGTATTGGG CAAACTTGAG CCGATTTACT GCGAAAatgt 
1401 cgccgACCTG CCGCAAATGC TGATGAATGT TTTACAGGAT Ggcgatgttg 

14 51 tgttgAATAT GggTgcggga agcatcaacc gcgttccttc cgcgctgttg 

15 01 gaattgtcga AACAGAtttg A 

This corresponds to the amino acid sequence <SEQ ID 310; ORF 092.ng>: 

g092 .pep 

1 MFFISIRYIF VRKLWCANGQ TFKITPLRTK NQPERNIMMK NRVSNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLSSLGIQ VYPGHTAEHV 

101 NG AD VW AS T AVKKENPEW AALERQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TAS I LGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDAS FLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 
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251 AFLCVDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLN ALAAIG VALEVGASVE AI QKGLLGFE 

3 51 GVGRRFQKYG DI KLPNGGTA LLVDDYGHHP VEMAATIiAAA RGAYPEKRLV 

4 01 LiAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP VAAADSRALA 
4 51 RAIRVLGKLE PI YCENVADL PQMLMNVLQD GDWLNMGAG SINRVPSALL 
501 ELSKQI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 1 >: 

m0 92 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TAT G ATG AAA AAT CGAGTT A CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGCATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAgGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC CGAACACGTT 

3 01 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 

3 51 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

4 01 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 

4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

5 01 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

5 51 GCACTAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

6 01 GAGTCGGATG CATCCTTTCT GCACCTGACA CCGATTATGT CCGTCGTTAC 
6 51 CAATATCGAC GAAGAC C ATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 
701 TGCATCAGGC GTTTAT CGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 
751 GCCTTTTTGT GTATTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 
801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 
851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
901 CAAATGAAAG G ACAT GAG C A GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC AT CGGTTGAA GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATACGGC GACATCAAGT TGCCAAACGG 

1101 CGGGACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGACCCT TGCCGCCGCA CGCGGCGCGT AT CTGGAAAA ACGTTTGGTA 

1201 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

13 01 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CCGATTCCCG CGCTCTTGCC 

13 51 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 
1401 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 

14 51 TGTTGAATAT GGGCGCGGGA AGCATCAACC GCGTCCCCGC CGCGCTGCTG 
1501 GCATTGTCGA AACAGATTTG A 

This corresponds to the amino acid sequence <SEQ ID 3 12; ORF 092>: 

m092 .pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VY PGHTAEHV 

101 NGADVWTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TAS I LGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLN ALAAIG VALEVGASVE AI QKGLLGFE 

3 51 GVGRRFQKYG DI KLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYLEKRLV 

4 01 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 
451 RAIRVLGKLE P I YCENVADL PEMLLNVLQD GD I VLNMGAG S I NRVPAALL 
501 ALSKQI * 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 092 shows 96.6% identity over a 506 aa overlap with a predicted ORF (ORF 092.ng) 
from N. gonorrhoeae: 

m092/g092 

10 20 30 40 50 60 

m092 .pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 
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I i M I IIIMIIIII MM IMIIII M MMMMMMMMMMMMMM 

g092 MFFISIRYIFVRKLWCANGQTFKITPLRTKNQPERNIMMKNRVSNIHFVGIGGVGMSGIA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m092 . pep E VLHNLGFKVSGSDQARNAATEHLGSIjG I QV Y PGHTAEHVNGADVWTS TAVKKENPEW 

i I I I II I I MM II Ml I I I i llhl I Ml I I I I I! I I I I 111 I I 1^ M II III I I I M 

g092 EVLHNLGFKVSGSDQARNAATEHLSSLGIQVYPGHTAEHVNGADVWASTAVKICENPEW 

70 80 90 100 110 120 

130 140 150 160 170 180 

m092 . pep AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

M MM 1 1 II M MM M MM IM MMM MM Ml 1 1 1 MM M I M I MMMMM 

g092 AALERQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

130 140 150 160 170 180 

190 200 210 220 230 240 

n\092 . pep NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 

M ! MM || MMI I II II II Ml II M MMM Ml I II IMMMM M I MM II I I 

g 0 9 2 NAAGTNARLGKGE Y I VAE ADES DAS FLHLTP I MSWTN I DEDHMDT YGHS VEKLHQAF I D 

190 200 210 220 230 240 

250 260 270 280 290 300 

mO 92 . pep F I HRMP F YGKAFLC I DS EHVRAI L PKVS KP YATYGLDDTAD I YATD I ENVGAQMKFTVHV 

MM MMM MMM MMIM MMM MMM MMM MMM M MMMMM I 

g092 F I HRMPFYGKAFIjCVDS EHVRAI L PKVS KP YATYGLDDTAD I YATD I ENVGAQMKFTVHV 

250 260 270 280 290 300 



310 320 330 340 350 360 

mO 92 . pep QMKGHEQGSFEWLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

I ! M I I M I Ml MM M I I I MMMI MM I I MM I I Ml I I MM I I MM III M 

g092 QMKGHEQGSFEWLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 



370 380 390 400 410 420 

mO 9 2 . pep DI KXjPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 

I I I MM IMIIMIMM MMMIMIIMM MMMMMMMIMMMMM 

gO 9 2 D I KLPNGGTALLVDDYGHHPVEMAATLAAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 



430 440 450 460 470 480 

m092 . pep VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLiNVLQD 

I I I I M M 1 1 1 1 1 1 1 M M I M 1 1 1 1 1 M M I M 1 1 II M M 1 1 1 1 1 M I M M M M M 

g 0 9 2 VLNTVDALVLTEVYAAGEE PVAAADS RALARAI RVLGKLE P I YCENVADLPQMLMNVLQD 

430 440 450 460 470 480 



490 500 
m092 . pep GD I VLNMGAGS I NRVPAALLALS KQ I X 

IMMIMIMMMMMI MIMI 

g 0 9 2 GD WLNMGAGS I NRVPS ALLELS KQIX 

490 500 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 313>: 

a092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGT C AG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AAT CGAGTGA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 TTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 
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301 AACGGTGCGG ATGTCGTCGT 

351 CGAAGTTGTC GCTGCGTTGG 

4 01 TGATGTTGGC GGAGTTGATG 

4 51 ACGCACGGCA AAACCACGAC 

501 GGCAGGACTT GACCCGACTT 

551 GCACCAACGC CCGCTTGGGC 

601 GAGTCGGATG CATCCTTTCT 

651 CAATATCGAC GAAGACCATA 

701 TGCATCAGGC GTTTATCGAT 

7 51 GCCTTTTTGT GTATTGACAG 

801 GAGCAAACCT TATGCTACTT 

851 CCACCGACAT CGAAAACGTC 

901 CAAATGAAAG GACATGAGCA 

951 CGGC AG AC AC AACGTGCTGA 

1001 AAGTCGGCGC ATCGGTTGAA 

1051 GGTGTCGGCC GCCGCTTCCA 

1101 TGGAACCGCG CTCTTGGTGG 

1151 CGGCGACCCT TTCCGCCGCA 

1201 CTCGCCTTCC AGCCGCACCG 

1251 CTTTACCAAA GTCCTCAATA 

1301 ATGCCGCCGG TGAAGAGCCG 

1351 CGCGCCATCC GCGTGTTGGG 

14 01 TGCCGATCTG CCCGAAATGC 

14 51 TGTTGAAT AT GGGTGCGGGA 

1501 GAATTGTCGA AACAGATTTG 



TACCTCTACC GCCGTCAAAA AAGAAAATCC 

AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

CGCTTCCGTG ACGGCATCGC CATTGCCGGC 

CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

TCGTTATCGG CGGCAAACTC AACGCCGCAG 

AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

GCACCTGACA CCGATTATGT CCGTCGTTAC 

TGGATACCTA CGGGCACAGT GTTGAGAAGC 

TTCATCCACC GTATGCCCTT CTACGGCAAA 

CGAACACGTC CGCGCGATTT TGCCCAAAGT 

ACGGTTTGGA CGATACCGCC GACATCTACG 

GGCGCGCAAA TGAAATTCAC CGTCCATGTT 

GGGGTCGTTT GAAGTCGTGC TGAATATGCC 

ACGCATTGGC AG C CAT CGGC GTGGCGCTGG 

GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 

AAAATACGGC GACATCAAGT TGCCAAACGG 

ACGACTACGG ACACCACCCC GTCGAAATGG 

CGCGGCGCGT ATCCGGAAAA ACGTTTGGTA 

CTATACCCGC ACGCGCGATT TGTTTGAAGA 

CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

ATTGCCGCCG CTGATTCCCG CGCTCTTGCC 

CAAACTCGAG CCGATTTACT GCGAAAACGT 

TGTTGAACGT TTTGCAGGAC GGCGACATCG 

AG CAT C AACC GCGTCCCCGC CGCGCTGCTG 
A 



This corresponds to the amino acid sequence <SEQ ID 314; ORF 092.a>: 

a092 .pep 

MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 
IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 
101 NGADVWTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 
151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 
201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 
251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 
301 QMKGHEQGSF EWLNMPGRH NVLNA LAAIG VALEVGASVE AI QKGLLGFE 
GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLSAA RGAYPEKRLV 
LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 
RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 
ELSKQI* 



1 

51 



351 
401 
451 
501 



m092/a092 99.4% identity over a 506 aa overlap 



10 20 30 40 50 60 

m0 92 .pep MFFI S I RYI FVRKLWRANGQPFKITPLRI EN PPERNIMMKNRVTN I HFVG IGGVGMSGIA 

I I I I I I M I I I I I I I II II I I I I I I I II I I II I I I I I I I t I I I I I I I I M I I I I II I ! II 
a092 MFFISIRYIHVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m092 .pep EVLHNLGFPCVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEW 
I I I I I I I I II I I II I I I ! I I I I I I I II I I I I I I I I I I ! I I I M I I I I I I I M I I If I I I I 
a0 92 EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEVV 

70 80 90 100 110 120 



130 140 150 160 170 180 

m092,pep AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 
I I I I I M I I I I I I I II I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a0 92 AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

130 140 150 160 170 180 



190 200 210 220 230 240 

mO 9 2 . pep N AAGTNARLGKGE YI VAEADES D AS FLHLT P IMS VVTN I DEDHMDT YGHS VEKLHQAFI D 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I M M I I I I I 
a092 NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 
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190 200 210 220 230 240 

250 260 270 280 290 300 

m092 . pep FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 
I I I M I I I I I I I t I II M I I I M t i I I I I i I I I I I M I ) ! I I If | | M I I I I I M I I I I I 
a092 FIHRMPFYGKAFLCIDSEHVRAILPKVSKPYATYGLDDTADIYATDIENVGAQMKFTVHV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m092 . pep QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
I M M I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I i I I I I I II M I I I I I 
a092 QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 

370 380 390 400 410 420 

m0 92 . pep DIKLPNGGTALLVDDYGHH PVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 

I M I I II I II I I I I I I I I I M I I I I I I : I I M I I I I I I I I I I M I || I I | I | | | | | | | I 
a092 DIKLPNGGTALLVDDYGHH PVEMAATLSAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 

430 440 450 460 470 480 

m092 . pep VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 

I I M I I I I I I I I I I M t I I I I I I I 11 I I I I I I I I I I I I | | | i I I I I I I II I I I | 

a092 VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 

430 440 450 460 470 480 

490 500 
m092 . pep GDIVLNMGAGSINRVPAALLALSKQIX 
I I I I I I I I I I I II II I I I I I I I II I I 
a092 GDIVLNMGAGS INRVPAALLELSKQIX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 315>: 

g093 . seq 

1 a TGCAGAAT t ttgGCAAAGT ggccgtATTG ATGGGtggtT TTTCCAGCGA 

51 ACGAGAa ate tcgcTGGACA GCgGTACCGC CATTTTGAAC GCCTTAAAAA 

101 G C AAAGG CAT AGACGCATAC GCCTTCGACC CTAAGGAAAC GCCGTTATCC 

151 GAACTGAAGG AGCGGGGCTT T CAGACGG C A TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCCTCCG CCATCGG CAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTACCCGTTC CCGAGTT CGC 

3 51 CGTACTGTAC GATGATACCG ATTTCGATGC CGTCGAAGAA AAATTGGGTC 

4 01 TGCCGATGTT TGTGAAGCCG GCGGCCGAAG GCAGCAGCgt cggcgtggta 
4 51 aAAGTCAAAG AAAaaggccg TCTGAAAAGC GTTtacgaag aatTGAaaCA 
501 CCTTcagggg cgaAAtcatt gccgAacgTT TTATCGG CGG CGGCGAATAT 
551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATCCC 
601 CGCAACCGAG TTTTACGAct aegaagecaa GtacaaCCGA GACGAcacca 
651 tttaTCAATG T C CT T CGG AA GATTTGACCG AAGCCGAAGA AAGCCTGATG 
701 CGCGAACTGG CGGTTCGCGG CGCACAGGCA AT CGGTGCGG AAGGCTGCGT 
751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 
8 01 TCAACACCCT GCCCGGTATG AC CGG C CAT A G 

This corresponds to the amino acid sequence <SEQ ID 316; ORF 093.ng>: 
g093 .pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKERGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLY DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKEKGRLKS VYEELKHLQG RNHCRTFYRR RRIFLPRPER QRAARHTHHP 

2 01 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RTGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 7>: 

m093.seq 
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1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CTAAAGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCrAA GACGGGGCGG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

3 01 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

3 51 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 
4 51 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 
501 CCTTCAGGG. CGAAATCATT GCCGAACGTT TTATCGGCGG CGGCGAATAT 
551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATTCC 
601 CGCAACCGAG TTTTACGACT ACGAAGCCAA GTACAACCGC GACGACACCA 
651 TTTATCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGCCTGATG 
701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 
751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 
801 TCAACACCCT GCCCGGTATG ACGAGCCATA G 



This corresponds to the amino acid sequence <SEQ ID 31 8; ORF 093>: 

m093 . pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 



51 ELKAQGFQTA FNILHGTYGX DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHLQX RNHCRTFYRR RRIFLPRPER QRAARHTHHS 

2 01 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDEP* 



Computer analysis of this amino acid sequence gave the following results : 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 093 shows 96.7% identity over a 276 aa overlap with a predicted ORF (ORF 093 .ng) 
from N. gonorrhoeae: 

m093/g093 



m093 .pep 



10 20 30 40 50 60 

MQN FG KVAVLMGG FS S E RE I S LDS G TAI LNALKS KG I DA YAFD P KE T P LS E L KAQ G FQTA 



g093 




10 20 30 40 50 60 



m093 .pep 



70 80 90 100 110 120 

FNI LHGTYGXDGAVQGALELLGI P YTG S GVAAS AI GMD KYR C KL I WQALGL P VP E FAVLH 



9093 




70 80 90 100 110 120 



m0 93 . pep 



130 140 150 160 170 180 

DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 



g093 




130 140 150 160 170 180 



m093 -pep 



190 200 210 220 230 240 

RRI FLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMS FGRFDRS RRKPD ARTGGS R 



g093 




190 200 210 220 230 240 



m093 -pep 



250 260 270 

RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 



g093 




250 260 270 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 319>: 

a093 . seq 

1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CCAAGGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 

4 51 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 

501 CTTTCAGGG . CGAAATCATT GCCGAACGGT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCTG TGTTGAACGG CAAAGGCCTG CCCGGCATAC ACATCATCCC 

601 CGCGACCGAG TTTTATGACT ACGAAGCCAA GTACAACCGC AACGACACCA 

651 TTTATCAATG TCCTTCGGAA GATCTGACCG AAGCCGAAGA AAGCCTGATG 

7 01 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 320; ORF 093.a>: 

a093 -pep 

1 MQN FGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDFKETPLS 

51 ELKAQGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGVV 

151 KVKGKGRLKS VYEELKHFQX RNHCRTVYRR RRIFLPCVER QRPARHTHHP 

201 RDRVL*LRSQ VQPQRHHLSM SFGRSDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

m093/a093 95.7% identity over a 276 aa overlap 

10 20 30 40 50 60 

MQN FGKVAVLMGG FS SERE I S LDSGTAI LNALKSKGI DAYAFDPKET PLSELKAQGFQTA 

I | | M I I I I I I I I I I I M II II I I I I I i t I I I II I I I I I I I i I M M M I I I I M t i I M 
MQN FGKVAVLMGGFS SERE IS LDSGTAI LNALKSKGI DAYAFDPKET PLSELKAQGFQTA 
10 20 30 40 50 60 

70 80 90 100 110 120 

FN I LHGTYGXDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALG LPVPEFAVLH 

t M I I II I I i I I I M I I I I I I I I I I M I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I 
FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 
70 80 90 100 110 120 



m093 .pep 
a093 

m093 . pep 
a093 



130 140 150 160 170 180 

m0 93 pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 

| | | | | | | | | | | I I I I I I I I I I I I M I I I I M I I I 1 I I I I M I I I I I I : 1 M I II 

a093 DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHFQXRNHCRTVYRR 

130 140 150 160 170 180 

190 200 210 220 230 240 

m0 93 pep RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 

| ! 1 | | | | I IE i : I I II I: II I I I I I I I I I M I M I I I I I I I I I 

a0 93 RRI FLPCVERQRPARHTHHPRDRVLXLRSQVQPQRHHLSMSFGRSDRSRRKPDARTGGSR 

190 200 210 220 230 240 



250 260 270 

mO 93 . pep RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 
M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 11 : I I 
a0 93 RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 

250 260 270 
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The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 32 1>: 

g094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTGG GGTCGAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTAccggc aatggcgatg 

151 cCGTCACGGA AGCGCATCAG CTCTGCCAGC ATCAAGGCGC GCGGAATAAC 

201 GGGAATTTGC CGCTCCAACG CAgcgacaAC TTCGGgattT TCTTTCTTGA 

251 CGGCGGTAGA GGCAACGACG ACATccgcAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 322; ORF 094.ng>: 

g094 -pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRISSAS IKARGITGIC ' RSNAATTSGF SFLTAVEATT TSAPLTCSAV 
101 WPG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 323>; 

m094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGT T A GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AG CGCAT C AA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCGGCGGTA 

3 01 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 324; ORF 094>: 

m094 . pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMA M 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 094 shows 95.1% identity over a 103 aa overlap with a predicted ORF (ORF 094.ng) 
from N. gonorrhoeae: 

m094/g094 



m094 . pep 
g094 



m0 94 .pep 
g094 

70 80 90 100 



10 20 30 40 50 60 

MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVVVVLPCVPAMAMPSRKRINSAN 

1 1 1 1 1 1 1 i 1 1 1 1 i i t f i I f 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 f 1 1 i I f i 1 1 i ! I i 1 1 1 = 1 1 = 

MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVVVVLPCVPAMAMPSRKRISSAS 
10 20 30 40 50 60 

70 80 90 100 

I RARGI TGI CCSNAATTSGFS FLTAVEVTTTS APLTCS AVWPGX 

MINIMI IMIIIIIIIIIIIIhlMIMIMIIIIIII 

I KARG I TGI CRSNAATTSGFS FLTAVEATTTS APLTCS AVWPGX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 325>: 

a094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 326; ORF 094.a>: 
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a094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVVV VLPCVPAMA M 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

m094/a094 100.0% identity over a 103 aa overlap 



10 20 30 40 50 60 

m094 .pep MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVVVVLPCVPAMAMPSRKRINSAN 
I I I I I I I M I I I II I ! If I I I I II I I I f I I I M I I I I II I M I I I I I I I I M I 1 I II I I I 
a094 MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRINSAN 

10 20 30 40 50 60 



70 80 90 100 

mO 9 4 . pep I RARGITG I CCSNAATT SG FS FLTAVE VTTTS APLTCSAVW PGX 

I I I I I I I I I I ! I I M I M I I I I I I I I I I I I I |.| I I | | | | | M I I 
a094 IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 327>: 

g095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AACACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TAT TGGTAAC GACGGACATA AT CGGGGTCA GTGTAGGAAA 

3 01 GAGGCATCGG ATCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

3 51 CGGGCGTTGG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 328; ORF 095 .ng>: 

g095 .pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGG I VQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDE I DK RL MQFFDAVPVG IHMVFV DIGN DGHNRGQCRK 

101 EASDRRLRQR CIRLCPSGRW CLRR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 329>: 

m095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

3 01 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTAG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 330; ORF 095>: 

m095 .pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGG I VQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDE I DKR L MQFFDAVPVG IHMVFV DIGN DGHNRCQCRK 

101 DAS DRRLRQR CIRLCPSGRX CLRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 095 shows 97.6% identity over a 124 aa overlap with a predicted ORF (ORF 095. ng) 
from N. gonorrhoeae: 

m095/g095 



10 20 30 40 50 60 

m095 . pep MS FHLNMDGE FHLRADVFDVGGVDVGGI VQTVSS I RFAHFGQNRADVFAVNTQKGFAVEG 

Mill li II II IIMIMI Ml I I lilllllil II I II I IN || MMIIMlh || I I 
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g095 MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m095 . pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

I III I II MINIM II III III I MMM I ll; I I I II : I I i I I I II I I I I I I I II I 

g095 HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRGQCRKEASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 



m095.pep CLRRX 

Mill 

gO 9 5 CLRRX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33 1>: 

a095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGCTTC TCAACACTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTGG TGCCTGCGGC GTTGA 



This corresponds to the amino acid sequence <SEQ ID 332; ORF 095.a>: 

a095.pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKRL MQLLNTVPVG IHMVFVDIGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRW CLRR* 

m095/a095 96.0% identity in 124 aa overlap 

10 20 30 40 50 60 

m0 95 . pep MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 
I M I I I I I I I I 1 I I I I I I I I I I I I I I I M I I I M I I I I I I M I I II I I I I II I) I I I I I I 
a095 MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m095.pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

I II I 1 I I I M M :::: I I I I II I II I I I I I I I I I I I I I I I I I j I I I I I I I I I M I I I I I 
a095 HTVDEIDKRLMQLLNTVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 

m0 95.pep CLRRX 
IMII 

a095 CLRRX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 

333>: 

g096 . seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGTGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGAAGGAC ATTGAtaaat ggtgTCGTCT CGGttgtaCt 
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201 tggcttcgta gTCGTAAAAC TCGGTTGCGG GGATGATGTG TATGCCGGGC 

2 51 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGATAAA 
301 AcgtTcggca atgaTTtcgc ccctgAAGGT GttTCAattc ttcgtaAACG 

3 51 CTTTTCAGAc ggccttTTTC TTTGA 



This corresponds to the amino acid sequence <SEQ ID 334; ORF 096.ng>: 

g096 .pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLCAANR QFAHQAFFGF 

51 GQIFRRT LIN GVVSVVLGFV VVKL GCGDDV YAGQPFAVQD GAG I FAAADK 

101 TFGNDFAPEG VSILRKRFSD GLFL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 335>: 

m096.seq 

1 ATGGCTCGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CG C AC CG ATT 

101 GCCTGCGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGAAGGAC ATTGATAAAT GGTGTCGTCG CGGTTGTACT 

2 01 TGGCTTCGTA GTCGTAAAAC TCGGTTGCGG GAATGATGTG TATGCCGGGC 

251 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 ACGTTCGGCA ATGATTTCGC CC . TGAAGGT GTTTCAATTC TTCGTAAACG 

351 CTTTTCAGAC GGCCTTTTCC TTTGA 



This corresponds to the amino acid sequence <SEQ ID 336; ORF 096>: 

mO 9 6 . pep 

1 MARHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 

51 GQIFRRT LIN GVVAVVLGFV WKL GCGNDV YAGQPFAVQD GAG I FAAADK 

101 TFGNDFAXEG VSILRKRFSD GLFL* 



m096/g096 96.0% identity in 124 aa overlap 



10 20 30 40 50 60 

m096.pep MARHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFGFGQIFRRTLIN 
II M I I I I I I I I M I M I I I I I I I I I I M I I I I I I I M I ! I I I I I I I I 1 | | | 1 | M n 
g09 6 MAGHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLCAANRQFAHQAFFGFGQIFRRTLIN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m096 . pep GV VA WLG FVVVKLGCGN DVYAGQ P FAVQDG AG I FAAADK T FGND FAXEGVS I LRKRFS D 

I I I • M I I I ! E I I I I I I : I M I I I I M MINIMUM M M M M M M 

5096 GWSWLG FVWKLGCG DDVYAGQPFAVQDGAGI FAAADKTFGNDFAPEGVS I LRKRFS D 

70 80 90 100 110 120 



m096. pep 
g096 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33 7>: 

a096.seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGCGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAGATCT TCCGAAGGAC ATTGATAAAT GGTGTCGTTG CGGTTGTACT 

201 TGGCTTCGTA GTCATAAAAC TCGGTCGCGG GGATGATGTG TATGCCGGGC 

251 AGGCCTTTGC CGTTCAACAC AGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 CCGTTCGGCA ATGATTTCGC CCT . GAAAGT GTTTCAATTC TTCGTAAACG 

351 CTTTTCAGAC GGCCTTTTCC TTTGA 

This corresponds to the amino acid sequence <SEQ ID 338; ORF 096.ng>: 
a 0 96 . pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 



GLFLX 
I M M 
GLFLX 
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51 GQIFRRT LIN GVVAVVLGFV VIKL GRGDDV YAGQAFAVQH RAGIFAAADK 
101 PFGND FAXES VSILRKRFSD GLFL* 



m096/a096 92.7% identity in 124 aa overlap 

10 20 30 40 50 60 

m096 . pep MARHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFG FGQIFRRTLIN 

II I I I I II I I I I I M I I I I I II I I I I I II I I I! I I I I I I I I I I I I I I I I I I I I i ! I M I 
a096 MAGHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFG FGQI FRRTLIN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m096 . pep GVVAVVLGFVWKLGCGNDVYAGQPFAVQDGAGIFAAADKTFGNDFAXEGVSILRKRFSD 
I I I I I I I I I I I : I M I : I I I II I I I I I I I I I I 11 I I I II I II I I : M M I I I I I t 
a096 GVVAVVLGFVV I KLGRGDDVYAGQAFAVQHRAG I FAAADKPFGND FAXES VSILRKRFSD 

70 80 90 100 110 120 



m09 6.pep GLFLX 
I J I I I 

a09 6 GLFLX 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 339>: 

g097 . seq 

1 ATGGATATTT CAAAACAAAC ATTGCTGGAT AGGGTTTTTA ACCTGAAGGC 

51 AAACGGTACG ACGGTACGTA CCGAGT TGAT GGCGGGTTTG ACGACCTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAATC CCCTGATTTT GGGCGAGACC 

151 GGAATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CATCCGCCAT 

201 CGGCTGTTTT GTCATGGGTT TTATCGGCAA CTATCCGATT GCGCTTGCCC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

3 01 GGCGTGCCTT GGCAGGTGGC GTTGGGTGCG GTGTTCATTT CCGGTCTGAT 
351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCCG CCGGTATCGG TTTGTTTTTG 
4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 
501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC CAGCGCACTG TTGGCATTGT 
551 TCGGTTTTGT CATGGTGGTC GTATTGGGGT ATTTCCGCGT TCAAGGCG CA 
601 ATCATCATCA CCATTCTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAACGAGTTT CACGGCGTGG TCGGCGAAGT ACCGGGCATT GCGCCGACCT 
701 TTATGCAGAT GGATTTTAAA GGTCTGTTTA CCGT CAG CAT GGTCAGCGTG 
751 ATTTTCGTCT TCTTCTTGGT CGATTTGTTC GACAGTACCG GAACGCTGGT 
801 CGGCGTATCC CACCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 
851 TGAAACGCGC ACTG CTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 
951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TGGCGTGTCT GATGTTCTCC CCATTGGCGA AAAGTGTTCC GGTATTTGCC 

1051 ACCGCGCCCG CACTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATTGAT TGGGACGATA TGACTGAAGC CGCGCCCGCG TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 

1201 TTCATCAGCT ATGCCGTGGT CAAACTTTTG TGTCGCCGGA CTGGGGACGT 

1251 GCCGCCTATG GTATGGGTTG TTGCCGTATT GTGGGCATTG AAATTCTGGT 

13 01 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 340; ORF 097.ng>: 

g097 .pep 

1 MDISKQTLLD RVFNLKANGT TVRTE LMAGL TTFLTMCYIV IVN PLILGET 

51 GMDMG AV FVA TCIASAIGCF VM GFIGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI I V AN P ATLVG LGDIHQPS AL LALFGFVMW VLGYF RVQGA 

201 IIITILTITV IASLMGL NEF HGWGEVPGI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLV DLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA D S T A I V AG AA 

3 01 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPVFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 
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4 01 FISYAWKLL CRRTGDVPPM VWWAVLWAL KFWYLG * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 341 >: 

m097 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACk ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCyGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

3 51 TTTTATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 
451 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 
501 CTTGGTCGGT TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCATTGT 
551 TCGGTTTTGC TATGGTGGTC GTATTGG G AC ATTTCCGCGT TCAAGGCGCA 
601 AT CATCATC A C C ATCTTG AC CATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAATGAATTT CACGG CAT C A TCGGCGAAGT ACCGAGCATT GCGCCGACTT 
701 TTATGCAGAT GGATTTTGAA GGCCTGTTTA CCGTCAGCAT GGTCAGTGTG 
751 ATTTTCGTCT TCTTCTTGGT CGATCTATTT GACAGTACCG GAACGCTGGT 
801 CGGCATATCC CACCGTGCCG GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 
851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCCACCAC GCCTTATGTG GAAAGCG CGG CGGGCGTATC 
951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC G CAGTGCG AG 

1101 GGATATTGAT TGGGACGATA TGACGGAAGC CGCACCTGCG TTCCTGACCA 

1151 TTGTTTTCAT GCCGTTTACT TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 

1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 342; ORF 097>: 

m097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IVN PXILGET 

51 GMDM GAVFVA TCIASAIGCF VM GFVGNY P I ALAPGMGLNA Y FT FAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI IVA NPATLVG LGDIHQPS AL LALFGFAMW VLGHF RVQGA 

2 01 IIITILTITV I AS LMGLN E F HGIIGEVPSI APTFMQMDFE GLFTVSMVSV 
251 IFVFFLV DLF DSTGTLVGIS HRAG LLVDG K LPRLKRALLA DSTAIVAGAA 

3 01 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 
351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAWKLL CRRTKDVPPM VWIVAVLWAL KFWYLG * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 097 shows 96.3% identity over a 436 aa overlap with a predicted ORF (ORF 097.ng) 
from N. gonorrhoeae: 

m097/g097 

10 20 30 40 50 60 

m097 . pep MDTSKQTLLDGI FKLKANGTTVRTELMAGLTTFLTMCYI VIVNPXI LGETGMDMGAVFVA 

|| | | M | I I MMMIIMIIIIIMMMIMIIIIIIIII IIIIIMIIIIIMI 
g0 97 MDISKQTLLDRVFNLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAVFVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m0 97 . pep TCI ASAI GCFVMGFVGNYP I ALAPGMGLNAYFTFAWKGMGVPWQVALGAVF I SGLI FI L 

II Ml Mil MM hi II IMIII III I I II II III II INI M I I I I I II M IIMM! 

g0 97 TCI AS AI GCFVMGF I GN Y P I ALAPGMGLNAY FTFAWKGMGVPWQVALGAVF I S GL I F I L 

70 80 90 100 110 120 

130 140 150 160 170 180 
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m0 97 -pep 
g097 



m0 9 7 .pep 
g097 



m097 .pep 
g097 



m0 97 . pep 
g097 



m097 . pep 
g097 



m097 . pep 
g097 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 343> 

a097 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACG ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCTGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 

4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCACTGT 

551 TCGGTTTTGC CATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 

601 AT CAT CAT C A CCATTTTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAACGAATTT CACGGCATCA TCGGCGAAGT GCCGAGCATT GCGCCGACTT 

7 01 TTATGCAGAT GGATTTTAAA GGGTTGTTTA CCGTCAGCAT GGTCAGCGTG 

7 51 ATTTTCGTCT TTTTCCTAGT CGATCTGTTC GACAGTACCG GAACACTGGT 

801 CGGTGTATCG CATCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CTATTGTGGC AGGTGCGGCT 

901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGTGCGG CGGGCGTATC 

951 GGCAGGCGGG CGGACAGGTC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GG AC AT CG AT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 



FS F FKVREMLVNALPMGLKMS I AAG I GLFLALI SLKGAG 1 1 VANPATLVGLGD I HQPS AL 

Ml II II III M MM MM MIMIIII MMIM II MM III Mill Mill I I III 

FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 
130 140 150 160 170 180 

190 200 210 220 230 240 

LALFGFAMVWLGHFRVQGAIIITILTITVIASLMGLNEFHGI IGEVPSIAPTFMQMDFE 

imi I Ml M I MM ! I M Mi I I MM II III I I M IM MMIM M MM I I I I M 

LALFGFVMVWLGYFRVQGAI I ITILTITVIASLMGLNEFHGWGEVPGIAPTFMQMDFK 
190 200 210 220 230 240 

250 260 270 280 290 300 

GLFTVSMVSVI FVFFLVDLFDSTGTLVGI SHRAGLLVDGKLPRLKRALLADSTAI VAGAA 

M I 1 1 1 1 1 1 1 1 M 1 1 E I ! 1 1 1 1 1 1 M 1 1 = 1 1 M E 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GLFTVSMVSVIFVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAIVAGAA 
250 260 270 280 290 300 

310 320 330 340 350 360 

LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

1 1 1 1 1 ! I 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M ^ 1 1 1 I 1 1 1 1 1 1 1 1 

LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFS PLAKSVPVFATAPALLYVGT 
310 320 330 340 350 360 

370 380 390 400 410 420 

QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 

1 1 1 1 1 1 1 1 1 1 1 M ! 1 1 1 1 1 11 1 1 II I M 1 M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II ! I Mill 

QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTGDVPPM 
370 380 390 400 410 420 

430 

VW I VAVLWALKFW YLGX 

I I : I I I I I I I I I I J I I I 
VWWAV LW ALK FW YLGX 
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1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 344; ORF 097. a>: 

a097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IV NPLILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFT FAVVKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI IV ANPATLVG LGDIHQPS AL LALFGFAMVV VLGHF RVQGA 

201 IIITILTITV IASLMGL NEF HGIIGEVPSI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLV DLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAXVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAVVKLL CRRTKDVPPM VWIVAVLWAL KFWYLG* 



m097/a097 99.3% identity in 436 aa overlap 

10 20 30 40 50 60 

m097 . pep MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPXILGETGMDMGAVFVA 
I I I I I I I I I I M I i I M I I I I I I I I I I I I I M I I I I 1 I I f II M I I I 1 M | | | i | | | | | 
a097 MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAVFVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m097 . pep TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFTFAWKGMGVPWQVALGAVFISGLIFIL 
I I I I I II I M I I I I II I I I I I 1 I I I II I I I I I I I II M I I I I I I I I | | | | | M | | | | | | | 
a0 97 TCI AS AI GC FVMGFVGN Y P I ALAPGMG LN A Y FT FAVVKGMG V PWQV ALGAV FISGLIFIL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 97 .pep FS F FK VREML VN AL PMG LKM S I AAG I GL FLAL I S LKGAG 1 1 VAN PAT L VG LG D I H QP SAL 

I I M I I I I I I I I IE I I I II I I I I I I II I I f | | | | | | | | | | | M I I I I I I I I I if 

a097 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m097 . pep LALFGFAMVVVLGHFRVQGAIIITILTITVIASLMGLNE FHGI IGEVPSIAPTFMQMDFE 

I I I I I I i I I I I I I I I I M I I I I II I I I I I I I If I I I I I I I 1 I I I 1 I | | | | | I I II I I I I : 
a097 LALFGFAMVWLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m097 . pep GLFTVSMVSVIFVFFLVDLFDSTGTLVGISHRAGLLVDGKLPRLKRALLADSTAIVAGAA 
I I M I I I I I! I I M M I I M I I I I I I I I : I I M I II I I 1 I I I 1 I M I I I I I I I I M I I i I 
a0 97 GLFTVSMVSV I FVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

250 260 270 280 290 300 

310 320 330 340 350 360 

m097 . pep LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 
I I I I I M I I I I I I I I I I I I I II II I I I I I M I I I I I II I I I I I I E I I I I I I I I i I f I M I 
a097 LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m097 . pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 
I I I I I I I I I I I I I I I I I I I I I I I I II I II I I II II I I If I I I I I I I M I II I I I I I I I M 
a097 QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAVVKLLCRRTKDVPPM 

370 380 390 400 410 420 

430 

mO 97 . pep VWIVAVLWALKFWYLGX 
I I II I I M I I I I I M I I 
a0 9 7 VWIVAVLWALKFWYLGX 

430 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 345>: 

g098 . seq 

1 ATGACCGCCG ACGGTCTCTT CGTCGCTTTC AACTTCAATA CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAGCAGGA TGCTGCCCAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCGCTG TCGGGATGGC TTTCGCGTTC 

151 GGGATGAACG CAGCCGAGCA TGGACACGCC GGTACGCATC ACGTCCATCG 

2 01 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

2 51 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

3 01 GTTGGGCAGA TGGCCGTGAA TCAGCAAGTG TGCGACTTCT TCAAACTCGC 
3 51 ATTTTTGTGC CAAATTAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 346; ORF 098.ng>: 

g098.pep 

1 MTADGLFVAF NFNTFAWR I L1PVQQDAAQ AGDQ FVGDVA RFAVGMAFAF 

51 GMNAAEHGHA GTHHVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV CDFFKLAFLC QIRMS* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 347>: 

m098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

3 51 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 348; ORF 098>: 

m098 . pep 

1 MTADGLFVAF NLNAFAWRI LIPVQEDAAE AGDQ FVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 098 shows 89.6% identity over a 125 aa overlap with a predicted ORF (ORF 098.ng) 
from N. gonorrhoeae: 

m098/g098 

10 20 30 40 50 60 

m098 -pep 
g098 



m098 .pep 
g098 



m098 .pep 
g098 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 349>: 

a098 .seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 
51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 



MTADGL FVAFNLNAFAWR I L I PVQEDAAEAGDQFVGDVARFT FRMAFT FRMNAAQHGYA 

IIIIIIIMIhhllllllllllhllhMIIMIIMIh llhl lllhlhl 

MTADGLFVAFNFNTFAWRI L I PVQQDAAQAGDQFVGDVARFAVGMAFAFGMNAAEHGHA 
10 20 30 40 50 60 

70 80 90 100 110 120 

GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

llhlllMIIIIIIMIIIIIIIIIMIIIIIIMIIMIIIIIIIIII lllllllli 

GTHHVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVCDFFKLAFLC 
70 80 90 100 110 120 



QIRMSX 

I I I I II 
QIRMSX 
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101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

3 51 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 350; ORF 098.a>: 

a098 -pep 

1 MTADGLFVAF NLNAFA WRI LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 



m098/a098 100.0% identity in 125 aa overlap 

10 20 30 40 50 60 

m0 98 . pep MTADGLFVAFNLNAFAVVR I LIPVQEDAAE AG DQFVGDVARFTFRMAFTFRMNAAQHGY A 

I I I I M I I I M I I I I I I I I I t I I M I M I I I I I I I I I I M I t I M I I I I I I i M I I II M 
a098 MTADGLFVAFNLNAFAWRILIPVQEDAAEAGDQFVGDVARFTFRMAFT FRMNAAQHGYA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m0 98 . pep GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 
I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I M M I I I I M I I M M I I I I 
a0 98 GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

70 80 90 100 110 120 



m098.pep QIRMSX 
Mill! 

a098 QIRMSX 



The following partial DNA sequence 

g099 . seq 



1 


ATGCTGGGAC 


GCGCGTCCAT 


51 


GCTGACGGGC 


AAACGGCAGG 


101 


CACTGACCGA 


ATT CTTGCGT 


151 


TTTTTCGGCG 


AGGGCGCGAG 


201 


TTCCAACATG 


ACGCCGGAGT 


251 


ACGCGCAAAC 


TATTGATTAT 


301 


GTGAAATTGG 


TGGAAACCTA 


351 


CTTGAAAACC 


GCCGTTTATC 


401 


TAACGCGCAA 


TATGGCAGGC 


451 


GCCGATTTGG 


CGGCGAAAGG 


501 


CGGCCAAATG 


CCTGACGGTG 


551 


CCAATACTTC 


CAACCCGCGC 


601 


AATGCCAACC 


GCCTCGGCTT 


651 


TGCCCCGGGT 


TCAAAAGTAG 


701 


TGCCCGAAAT 


GGAAAAACTC 


751 


ACCTGTAACG 


GCATGAgcgG 


801 


CATCGACCGC 


GAtttgtacg 


851 


TCGACGGCCG 


TATCCATCCG 


901 


CCTTTGGTCG 


TTGCCTACGC 


951 


AAACGACGTA 


CTCGGCGTTG 


1001 


TCTGGCCGAC 


AGACGAAGAA 


1051 


CCGCAACAAT 


TCCGCGACAT 


1101 


GCAAAAAGCA 


CCAAGCCCGC 


1151 


TCCGCCGTCC 


GCCCTATTGG 


1201 


AGAGGTATGC 


GTCCGCCGGC 


1251 


CATCTCgcca 


tCCAATGCGA 



i identified in N. gonorrhoeae <SEQ ED 35 1>: 

GATGCGCCTG CCCGATATTG TCGGCGTGGA 
CGGGCATTAC TGCCACAGAC ATCGTGTTGG 
AAAGAGCGCG TGGTCGGGGC GTTTGTCGAA 
AAGCCTGTCT ATCGGCGACC GCGCGACCAT 
TCGGCGCGAC TGCCGCCATG TTCGCCATCG 
TTGAAACTGA CCGGACGTGA CGACGCGCAG 
CGCCAAAACC GCAGGCTTAT GGGCAGGTGG 
CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 
CCGAGCAACC CGCACGCGCG TTTTGCCACC 
GCTGGCGAAG CCTTACGAAG AGCCTTCAGA 
CAGTGATTAT TGCCGCGATT ACTTCGTGTA 
AACGTTGTCG CCGCCGCACT GTTGGCACGC 
GAAACG C AAA CCTTGGGTGA AATCTTCGTT 
CCGGAATCTA TTTGAAAGAA GCAGGCTTGT 
GGCTTCGGTA TCGTCGCCTT CGCATGTACC 
CGCGCTcgaC CCGAAAATCC AACAAGAAAT 
CCACCGCCGT ATTGTCAGGC AACCGCAACT 
TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
ATTGGCAGGT AG CATC CGT T TCGATATTGA 
CAGACGGCCG CGAAATCCGC CTGAAAGATA 
ATCGATGCCA TCGTTGCCGA ATATGTGAAA 
TTATATCCCG ATGTCCGACA CCGGCACAGC 
TGTACGACTG GCGAC CGATG TCCACCTACA 
GAAGGCG CAC TGGCAGGGGA ACGTACATTA 
GATTTTGCCC GACAACATCA CCACCGACCA 
TTTTGGCCGG cagTGCcgca ggtgaATATT 
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13 01 TGGCGAAAAT GGGTTTGCCT GAAGAagaCT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTtcgt 
14 51 tggcacgcgT tgaacCAGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTT ATCATCATTG CCGGTGCGGA 
1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC GGCGAAGGGC GTGCGGCTGG 
1601 CGGGTGTGGA AGCCATCGCC GCCGAAGGTT TCGAGCGCAT CCACCGCACC 

16 51 AACCTCATCG GCATGGGCGT CTTGCCGCTG CAATTCAAAC CCGGCACCAA 

17 01 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 

17 51 AACGCACACC GCGCTGCGGC CTGACCCTCG TGATTCACCG TAAAAACGGA 

18 01 GAAACCGTCG AAGTTCCGGT TACCTGCCGC CCCGATACCG CAGAAGAAGC 
18 51 ATT GGTAT AT GAAGCCGGCG GCGTATTGCA ACGGTTTGCA CAGGACTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 352; ORF 099.ng>: 

g099 -pep 

1 MLGRASMMRL PDIVGVELTG KRQAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDAQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWAGGLKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAGIYLKE AGLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQQEIIDR DLYATAVLSG NRNFDGRIHP YAKQ AFLASP 

301 PLWAYALAG SI RFDIENDV LGVADGRE I R LKDIWPTDEE I DAI VAEYVK 

351 PQQFRDIYIP MSDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

401 RGMRPPAILP DNITTDHISP SNAILAGSAA GEYLAKMGLP EEDFNSYATH 

4 51 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVE AI A AEGFERIHRT 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCG LTLVIHRKNG 

601 ETVEVPVTCR PDTAEEALVY EAGGVLQRFA QDFLEGNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 353>: 

m099 . seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGCAGG CGGGCATTAC GG CGACGG AT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT AT CGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

3 01 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

3 51 CTTGAAAACC GCCGTTTATC CTCGCGTTTT GAAATTTGAT TTGAGCAGCG 

401 TAACGCGCAA TATGG CAGGC CCAAGTAACC CGCATGCCCG TTTTGCGACC 

451 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCGGA 

501 CGGCCAAATG CCCGACGGCT CGGTCATCAT CGCCGCGATT ACCAGTTGCA 

551 CCAACACTTC CAACCCGCGC AACGTTGTTG CCGCCGCGCT CTTGGCACGC 

601 AATGCCAACC GTCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 

651 TGCCCCGGGT T C AAAAGT AG CCGAAATCTA TTTGAAAGAA GCGGGCCTGT 

701 TGCCCGAAAT GGAAAAACTC GG CTTCGGTA TCGTCGCCTT CGCCTGCACC 

751 AC CTGCAACG GCATGAGTGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 

801 CATCGACCGC GATTTGTACG CCACCGCCGT AT T ATCAGGC AACCGCAACT 

851 TCGACGGCCG TATCCACCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 

901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGTATCCGTT TCGATATTGA 

951 AAACGACGTA CTCGGCGTTG CAGACGGCAA GGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTGC CGATGAAGAA ATCGATGCCG TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAGT TCCGCGATGT GTATGTACCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCCAGTCCGC TGTACGATTG GCGTCCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCTTACTGG GAAGGCGCGC TGGCAGGGGA ACGCACATTA 

1201 AGAGGTATGC GTCCGCTGGC GATTTTGCCC G AC AAC AT C A CCACCGACCA 

1251 CCTCTCGCCG TCCAATGCGA TTTTGGCCGT CAGTGCCGCA GGCGAGTATT 

13 01 TGGCGAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 
1351 CGCGGCGACC ACTTGACCGC CCAACGCGCT ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAAAA ACGAAGACGG CAGCGTGCGC CAAGGCTCGT 
14 51 TCGCCCGCGT CGAACCCGAA GGCGAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC ATCATCATTG CCGGTGCGGA 
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1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 

1601 CCGGCGTAGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 

1651 AACCTTATCG GCATGGGCGT GTTGCCGCTG CAGTTCAAAC CCGACACCAA 

1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTGGTCGGCG 

1751 AACGCACACC GCGCTGCGAC CTGACCCTCG TGATT CACCG TAAAAACGGC 

1801 GAAACCGTTG AAGTTCCCGT TACCTGCTGC CTCGATACTG C AG AAGAAG T 

1851 ATTGG TATAT GAAGCCGGCG GCGTGTTGCA ACGGTTTGCA CAGGATTTTT 

1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 354; ORF 099>: 

m099 .pep 

1 MLGRASMMRL PDIVGVELNG KRQAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM PDGSVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAEIYLKE AGLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQAFLASP 

301 PLWAYALAG SI RFDIENDV LGVADGKEIR LKDIWPADEE IDAWAEYVK 

351 PQQFRDVYVP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

401 RGMRPLAILP DNITTDHLSP SNAILAVSAA GEYLAKMGLP E ED FN S Y ATH 

4 51 RGDHLTAQRA TFANPKLFNE MVKNEDGSVR QGSFARVEPE GETMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GS SRDVJAAKG VRLAGVEAIV AEGFERIHRT 

551 NLIGMGVLPL QFKPDTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPVTCC LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 099 shows 96.2% identity over a 639 aa overlap with a predicted ORF (ORF 099.ng) 
from N. gonorrhoeae: 

m099/g099 

10 20 30 40 50 60 

m099 .pep 
g099 



m099 .pep 
g099 



m099 .pep 
g099 



m099 .pep 
g099 



m099 -pep 
g099 

250 260 270 280 290 300 

310 320 330 340 350 360 



MLGRASMMRLPD I VGVELNGKRQAGITATD I VLALTEFLRKERVVGAFVE FFGEGARSLS 

Ml II I II MM Ml I I I M I I Mill MM I I M M I I M M I I I M M MMI M I M I 

MLGRASMMRLPD I VGVELTG KRQAGITATD I VLALTEFLRKERVVGAFVE FFGEGARSLS 
10 20 30 40 50 60 

70 80 90 100 110 120 

IGDRATISNMTPEFGATAAMFAIDEQTIDYLKLTGRDDAQVKLVETYAKTAGLWADALKT 

1 1 1 1 1 1 1 1 1 M I II 1 1 1 i M M 1 1 II 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 --III 

IGDRATISNMTPEFGATAAMFAIDAQTIDYLKLTGRDDAQVKLVETYAKTAGLWAGGLKT 
70 80 90 100 110 120 

130 140 150 160 170 180 

AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 

III IM II MM I MM I I M II Mill I MM Ml Ml M MM M MM M !M M M I 

AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGAVIIAAI 
130 140 150 160 170 180 

190 200 210 220 230 240 

TSCTNTSNPRNWAAALIJ^ANRLGLKRKPWKSSFAPGSKVAEIYLKEAGLLPEMEKL 

I II II I I I IMI I M I I M I II ill i II MM I I M I I I M II M I 1 1 M I I I I I 1 1 I M 

TSCTNTSNPR1TVVAAALLARNANRLGLKRKPWVKSSFAPGSKVAGIYLKEAGLLPEMEKL 
190 200 210 220 230 240 

250 260 270 280 290 300 

GFGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 

1 1 1 I ! I 1 M I f 1 1 M. I M I M I 1 1 : ! ! I 1 1 1 I M I I. ! I. I I I M I 1 1 ! M I ! i M. I 1 1 ! M 

GFGIVAFACTTCNGMSGALDPKIQQEI IDRDLYATAVLSGNRNFDGRIHPYAKQAFLAS P 
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m099 . pep P LW AY ALAGS I RFD I ENDVLGV ADG KE I RLKD I W PAD EE I DAWAE YVKPQQ FRDV YV P 

1 1 1 1 1 f 1 1 ) 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i i = i r f 1 1 1 1 1 1 = 1 1 f i ) i = 1 1 1 1 1 1 1 m t f r = i = i 

g099 PLWAYALAGSIRFDIENDVLGVADGREIRLKDIWPTDEEIDAIVAEYVKPQQFRDIYIP 

310 320 330 340 350 360 



370 380 390 400 410 420 

m0 99.pep MFDTGT AQ KA P S P L YDWR PM S T Y I RR P P YWE G ALAGE RTLRGMR P LA I L PDN ITTDHLSP 

I I I I II I II Ml M I IMI II li Mi I! I il! II i I I I II II i ! MIMMMIhll 

g099 MSDTGTAQKAPS PL YDWRPMSTYI RRP P YWE G ALAGE RTLRGMR P PAI LPDNI TTDHI S P 

370 380 390 400 410 420 



430 440 450 460 470 480 

mO 99 pep SNAILAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 

MINI IIMIMIIIIIMMIIIIIIIIIMIMIMIMMMMMIHMMII 

g099 SNAILAGSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 440 450 460 470 480 



490 500 510 520 530 540 

mO 9 9 pep QGSFARVEPEGETMRMWEAIETYMNRKQPLI 1 1 AGADYGQGS S RDW AAKGVRLAGVE AI V 

II MMMMMM I! ! 1 1 1 1 IMI M MM Ml 1 1 1 M II MM I! I M M MM M M ! : 

g 0 9 9 QGSLARVEPEGQTMRMWEAIETYMNRKQPLI 1 1 AGADYGQGS S RDW AAKGVRLAGVE AI A 

490 500 510 520 530 540 



550 560 570 580 590 600 

m0 99 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDWGERTPRCDLTLVIHRKNG 

II Ml II I M I I Ml I li I II Ml M II III MM ! M III IM M I II I 1 1 1 1 I i 1 1 I 

g099 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDWGERTPRCGLTLVIHRKMG 

550 560 570 580 590 , 600 



610 620 630 640 

mO 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

I M 1 1 II 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

g 0 9 9 ETVEVPVTCRPDTAEEALVYE AGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 355>: 
a099. seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGAAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC C ATT GAT T AT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

351 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 

4 01 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCGACC 

4 51 GCCGATTTGG CCGGCAAAGG CTTGGCTAAA CCTTACGAAG AGCCTTCAGA 

501 CGGCCAAATG CCTGACGGTG CAGTGATTAT TGCCGCGATT ACTTCCTGTA 

551 CCAATACTTC CAATCCGCGC AACGTTGTCG CCGCCGCGCT GTTGGCACGC 

601 AATGCCAACC GCCTCGGCTT GCAACGCAAA CCTTGGGTGA AATCTTCGTT 

651 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCAGATCTGC 

7 01 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTTGCCTT CGCATGTACC 

751 ACCTGTAACG GCATGAGCGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 

801 CATCGACCGC GATTTGTACG CCACCGCCGT ATTGTCAGGC AACCGCAACT 

851 TTGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 

901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGCATCCGTT TCGATATTGA 

951 AAACGACGTA CTCGGCGTTG CAGACGGCAA AGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTAC CGATGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAAT TTCGCGACGT TTATATCCCG ATGTTCGACA CCGGCACAGC 

1101 G C AAAAAG C A CCAAGCCCGC TGTACGACTG GCGTCCAATG TCTACCTATA 

1151 TCCGCCGCCC ACCTTACTGG GAAGGCGCAC TGGCAGGGGA AC G C AC AT T A 

1201 AGCGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 
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1251 TCTCTCGCCA TCCAATGCGA TTTTGGCAAG CAGTGCCGCA GGCGAATATT 

1301 TGGCAAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

1351 CGTGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTTCGC 

14 51 TGGCACGCGT TGAACCCGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 

1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC AT CAT CAT TG CCGGCGCGGA 

1551 CTACGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 

1601 CCGGCGTGGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 

1651 AACTTGATCG GTATGGGCGT GTTGCCGCTG CAGTTCAAAC CGGGTACCAA 

17 01 CCGCCACACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 

17 51 AACGCACACC GCGCTGCGAC CTGACCCTTG TGATTCACCG TAAAAACGGC 
1801 GAGACCGTCG AAGTCCCCAT TACCTGCCGC CTCGATACCG CAGAAGAAGT 

18 51 GTTGGTATAT GAAGCCGGTG GCGTATTGCA ACGGTTTGCA CAGGATTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 356; ORF 099.a>: 

a099 .pep 

1 MLGRASMMRL PDIVGVELNG KRKAGITATD IVLALTEFLR KERVVGAFVE 

51 FFGEGARSLS IGDRATISNM TPE FGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAGKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NVVAAALLAR 

201 NANRLGLQRK PWVKSSFAPG SKVAEIYLKE ADLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQ AFLASP 

301 PLWAYALAG SI RFDIENDV LGVADGKEIR LKDIWPTDEE IDAIVAEYVK 

351 PQQFRDVYIP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

4 01 SGMRPLAILP DNITTDHLSP SNAILASSAA GEYLAKMGLP EEDFNSYATH 

4 51 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERIHRT 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPITCR LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 



m099/a099 97.5% identity in 639 aa overlap 

10 20 30 40 50 60 

m0 99.pep MLGRASMMRL PDIVGVELNGKRQAGI TAT DIVLALTEFLRKERWGAFVEFFGEGARSLS 

I I I I I I I I I I I I I I IS I M I II : ! I I I I I I I I I I II I I I I I I 11 I 1 I I I I II M I I I I I 1 
a09 9 M LGRAS MMRL PDIVGVELNG KRKAGITAT DIVLALTEFLRKERWGAFVEFFGEGARSLS 

10 20 30 40 50 60 

70 80 90 100 110 120 

iti0 9 9.pep IGDRAT I SNMTPE FGATAAMFAI DEQT I DYLKLTGRDDAQVKLVETYAKT AGLWADALKT 

I I I I I I I I I I I ! I I I i I I M I I I M I I I I I I I I I I I I I I I II t I I I II I I I I I I I I I I I I 
a 0 9 9 IGDRAT I SNMT PE FGATAAMFAI DEQT I DYLKLTGRDDAQVKL VET YAKT AGLWADALKT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 99.pep AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 

II I I I I I I I M I I I I I I I I I I 1 1 I I I I I I I I I I I : II I I I I I I I I I I I I I I II : M I I II 
a099 AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAGKGLAKPYEEPSDGQMPDGAVIIAAI 

130 140 150 160 170 180 

190 200 210 220 230 240 

m0 99.pep T S CTNTSNPRN WAAALLARNANRLGLKRK PWVKSS FAPGSKVAE I YLKE AGLLPEMEKL 

M I I I I I I I I M I I II I I I I I II I I I I : I I I I I I I M I I II I I I I I I I I M I I M I I I I 
a099 TSCTNTSNPRNVVAAALLARNANRLGLQRKPWVKSS FAPGSKVAE I YLKEADLLPEMEKL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m099 . pep GFGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 
I I I I I I I I I I It I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I 11 1 I I I I I I I I I I I 
a099 GFGIVAFACTTCNGMSGALDPKIQKE I IDRDLYATAVLSGNRNFDGRIHP YAKQAFLASP 

250 260 270 280 290 300 



m09 9.pep 



310 320 330 340 350 360 

PLVVAYALAGS IRFD I ENDVLGVADGKE IRLKDI WPADEE I DAVVAE YVKPQQFRDVYV P 
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I I M 1 M I I I M I I I I I I ! I I I I I t I I I I I I It I M : I I I I I I : I t I ! I! I I ] I 1 I I I : I 
a09 9 PLVVAYALAGSIRFDIENDVLGVADGKEIRLKDIWPTDEEIDAIVAEYVKPQQFRDVYIP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m09 9.pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAILPDNITTDHLSP 
I I I I I I I I I I I I I I I I I I I M I M I I N I I I I I I I I I I I I I I M I I i I I I I I I I I I I I I 
a099 MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLSGMRPLAILPDNITTDHLSP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m0 99 . pep SNAILAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 
I I I I I I I I I If I I I I I I I M I I I I I I I I ! I M I If I I I I I I I I M II I I It : I I I I I 1 I 
a099 SNAILASSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 440 450 460 470 480 

490 500 510 520 530 540 

m099 . pep QGSFARVEPEGETMRMWEAIETYMNRKQPLIIIAGADYGQGSSRDWAAKGVRLAGVEAIV 
I I I : I I I I M I = I I I I I ! I M I I I I I I I I I I I I I I I I I M I I I I I M I I I M I I I I I I II 
a099 QGSLARVEPEGQTMRMWEAIETYMNRKQPLIIIAGADYGQGSSRDWAAKGVRLAGVEAIV 

490 500 510 520 530 540 

550 5 60 57 0 580 5 90 600 

m099 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDWGERTPRCDLTLVIHRKNG 
I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I i I I i I I 
a099 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDVVGERTPRCDLTLVIHRKNG 

550 560 570 580 590 600 

610 620 630 640 

mO 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 
I I I I I I : I I I I I I I I I II M I I I I I I II M I I I I I I I I I 
a0 99 ETVEVPITCRLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 35 7>: 

gl02 . seq 

1 AtgtCCGCCA AAactccgtc gctcttcggc ggcgcgatga Ttatcgccgg 

51 gaaggttatc ggcgcAGgta tgttccccaa ccccaccgcc aacttggggg 

101 acgggttaat aggctcgctg attgtgctgc tgtacacctg gtttccattc 

151 tcctccggcg ccctcatgat tttggaagtc aacacccata acCCccgagg 

201 ggcaAGtttt gacaccATGg tcAAagacct gctcgGaCGc ggctggaaca 

251 tcatcaacgg catcgccgtc gctttggTCc tatacggctc gacctacgcg 

301 tacattttag tcggcggtga cctGACCGCC AAAGGCAtcg GCAgCGCAGT 

351 AGGCGGCAAA ATTTCgctca CCGTCGGACA actcgtcttc tTCGGCATCC 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTTACCGGC 

451 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GGTTGCCGAT GCCAAACCGT CCGTCCTCTT CGACACCCAA GCCCCCGTCG 

551 GCACCGGCTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTTTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGcgcCc aaagtGgCGA aATCcatctg gGcaggtaca ttggTTGCCt 

701 tggtaattta cgtccTCTgg caaaccgcca tCcaaagcaa ccTGCcgcgc 

751 aacgagttcg cCCCcgtgat tgccgccgag aggcaactCT CCGTCCTgaa 

801 tgaaacccTG tccaaattcg cccaaaccgg cgatatggat aAaatattgt 

851 ccctatttcc ctacatggca atcgccacct cctttttagg cgTAACctta 

901 ggcctgtttg acaacatcgc cgacatcttc aaatggaacg acagtatgtc 

951 cgggcggggc accaaaaccg tcgcgctgaa cttcctgccg CCCCtgattt 

1001 cctggctgct cctccccacc ggcttcttta ccgccattgg tgcgtccggc 

1051 ctggcggcaa ccgtctggga ccaagGcatc atccccgcca tgctgctcta 

1101 cgtttccccc caaaaaattG gcGcaggcaa gacttataAa gtttaCGGCG 

1151 gcttgtggct gatgttagtc ttccttttcg gcatcgccaa catcgccgca 

1201 CAGGTATTGA GccaAatgGa ACtcgtCccc GTATTTAAAG GATAA 

This corresponds to the amino acid sequence <SEQ ID 358; ORF 102.ng>: 

gl02 .pep 
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310 



1 

51 
101 
151 
201 
251 
301 
351 
401 



MSAKT PSLFG GAMIIAGKVI GAGMFPNPTA NLGDGLIGSL IVLLYTWFPF 



SSGALMILEV 
YILVGGDLTA 
VLIGGMVLTF 



NTHNPRGASF 
KGIGSAVGGK 
IWATGGLVAD 



SFGFHGNVSS 
NEFAPVIAAE 
GLFDNIADIF 
LAATVWDQGI 
QVLSQMELVP 



LLKYFKGDAP 
RQLSVLNETL 
KWNDSMSGRG 
IPAMLLYVSP 
VFKG* 



DTMVKDLliGR GWNIING IAV ALVLYGSTYA 
ISLTVGQLVF FGILAFC VWA SARLVDRFTG 
AKPSVLFDTQ APVGTGYWIY AATALPVCLA 
KVAKSIW AGT LVALVIYVLW QTAI QSNLPR 
SKFAQTGDMD K ILSLFPYMA IATSFLGV TL 
TKTVALNFLP P LISWLLLPT GFFTAIGA SG 
QKIGAGKTYK VYGGLW LMLV FLFGIANIAA 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 359>: 



ml 02 . seq 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 



ATGCCCAACA 
CACGGTCATC 
TATGGTTTAC 
CTTTCCAGCG 
CGCAAGTTTC 
TCATCAACGG 
TATATCTTCG 
AGGCGGCGAC 
TCGCCTTTTG 
GTCCTTATCG 
GATTGCCGAT 
GCACAAACTA 
TCCTTCGGCT 
CGACGCGCCC 
TGGTAATTTA 
AACGAGTTCG 
CGAAACCCTG 
CCCTGTTTTC 
GGACTCTTCG 
CGGCCGCACC 
GCCTGCTCTT 
GCGGCAACCG 
GCGCAAAAAA 
GGCTGATGGT 
GTATTGAGCC 



AAACCCCTTC 
GGCGCAGGCA 
CGGCTCGCTG 
GCCTGATGAT 
GACACGATGG 
CATCGCCGTC 
TCGGCGGCGA 
GTTTCACTCA 
CGTATGGGCA 
GCGGCATGGT 
GCCAAGCCGT 
CTGGATTTAC 
TCCACGGCAA 
AAAGTGGCTA 
CGTCCTCTGG 
CCCCCGTCAT 
TCCAAATTCG 
CTATATGGCG 
ACTACATCGC 
AAAACCGCCG 
CCCCACCGGC 
TCTGGACAGG 
TTCGGCGCAG 
TTGGGTCTTC 
AAATGGAACT 



ACTGTTCGGC 
TGCTCGCCAA 
GCCGTGTTGC 
TTTGGAAGTC 
TCAAAGACCT 
GCCTTCGTTT 
CCTGACCGCC 
CCGTCGGACA 
TCCGCACGCT 
ATTGACCTTT 
CCGTCCTCTT 
GCCGCCACCG 
CGTCTCCAGC 
AATCCATCTG 
CAAACCGCCA 
CGCCGCCGAA 
CCCAAACCGG 
ATCGCCACCT 
CGACATCTTC 
CGCTGACCTT 
TTCGTTACCG 
CATCATCCCC 
GCAAAACCTA 
CTTTTCGGCA 
CGTCCCCGTA 



GGCGCGATGA 
CCCGACCGCC 
TGTACACCTG 
AACACCCATT 
GCTCGGACGC 
TATACCTGCT 
AAAGGCTTAG 
ACTCGTCTTC 
TGGTCGACCG 
ATTTGGGCGG 
CGATACCCAA 
CCCTGCCCGT 
CTGCTCAAAT 
GACGGGCACA 
TCCAAGGCAA 
GGGCAAGTCT 
CAATATGGAC 
CGTTTTTAGG 
AAATGGAACG 
CCTGCCGCCC 
CCATCGGCTA 
GCCATGCTGC 
TAAAGTTTAC 
TCGTCAACAT 
TTTAAAGGAT 



TTATCGCCGG 
ACATCCGGCG 
GTTTTCTATG 
ATCCGCACGG 
GGCTGGAACA 
TACTTACGCT 
GCAGCGCGGC 
TTCGGCATCC 
CTTCACCGGC 
CCGGCGGGCT 
GCCCCCGCCG 
CTGCCTCGCT 
ACTTTAAAGG 
CTGATTGCGC 
CCTGCCGCGC 
CCGTCCTCAT 
AAAATATTGT 
CGTAACGCTC 
ACAGCATCTC 
CTGATTTCCT 
CGTCGGCCTG 
TCTACCGTTC 
GGCGGCTTGT 
CGCCGCACAG 
AA 



This corresponds to the amino acid sequence <SEQ ID 360; ORF 102>: 

ml 02 .pep . 



1 
51 
101 
151 
201 
251 
301 
351 
401 



MPNKTPSLFG GAMIIAGTVI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 



LSSGLMILEV 
Y I FVGGDLTA 
VLIGGMVLTF 



NTHYPHGASF 
KGLGSAAGGD 
IWAAGGLIAD 



SFGFHGNVSS 
NEFAPVIAAE 
GLFDYIADIF 
AATVWTGII P 
VLSQMELVPV 



LLKYFKGDAP 
GQVSVLIETL 
KWNDSISGRT 
AMLLYRSRKK 
FKG* 



DTMVKDLLGR GWNII NGIAV AFVLYLLTYA 
VSLTVGQLVF FGILAFC VWA SARLVDRFTG 
AKPSVLFDTQ APAGT NYWIY AATALPVCLA 
KVAKS IWTGT LIALVIYVLW QT AIQGNLPR 
SKFAQTG NMD KILSLFSYMA IATS FLGVTL 
KTAALTFLPP LISCLL FPTG FVTAIGYVGL 
FGAGKTYKVY GGLWLMVWVF LFGIVNIAAQ 



xnl02/gl02 86.0% identity in 415 aa overlap 

10 20 30 40 50 60 

ml02 .pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
! I I I I I I I I I I I i II I I I I I I : III): I : III I I I I I I I : I :: I I I 11 I 
g!02 MSAKTPSLFGGAMIIAGKVIGAGMFPNPTANLGDGLIGSLIVLLYTWFPFSSGALMILEV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 02 .pep NTHYPHGASFDTMVKDLLGRGWNI INGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 

I M I : I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I I I I I I I I I : I I I : t I 
gl02 NTHNPRGASFDTMVKDLLGRGWNI INGIAVALVLYGSTYAYILVGGDLTAKGIGSAVGGK 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 02. pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
: I I I I I I I I I I I I II I I I I I I I I I I I I If I I I I I I I I I I I I I I : I I I : I I I I I E I I I I I I 
gl02 ISLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWATGGLVADAKPSVLFDTQ 

130 140 150 160 170 180 



ml 02 .pep 



190 200 210 220 230 240 

APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 
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I I : I I : I I I I I I I I I I I I M I I I ! I I I I I I I I I I M I I I I I I I I I I I : I I I : I I I I I M I 
g l02 APVGTGYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWAGTLVALVIYVLW 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml02 . pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I | I | | : | | I I I I I I I I I I I I I : I I I I I I I I I I I I I : E I I I I I I I I I I M I I M I I I I 
g 102 QTAIQSNLPRNEFAPVIAAERQLSVLNETLSKFAQTGDMDKILSLFPYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 

ml02 . pep GLFDYIADIFKWWDSISGR-TKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWT-GI 
MM M M M M M : M I I I I : I hi I I I I I I I I : I I I I I M I M M M I M 
g!02 GLFDNIADIFKWNDSMSGRGTKTVALNFLPPLISWLLLPTGFFTAIGASGLAATVWDQGI 

310 320 330 340 350 360 

360 370 380 390 400 410 

ml02 . pep IPAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIVNIAAQVLSQMELVPVFKGX 
M M M I I : I : I I I I I I I I I I I I I II : M I M M M M M M M M M M M I 
g 102 IPAMLLYVSPQKIGAGKTYKVYGGLWLML-VFLFGIANIAAQVLSQMELVPVFKGX 
370 380 390 400 410 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 361 >: 

al02 . seq 

1 ATGCCCACCA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGNTCATC GGCGCAGGTA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCCATG 

151 CTCTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCACT ACCCCCACGG 

2 01 CGCGANCTTC GACACCATGG TTAAAGACCT GCTCGGACGG AGCTGGAACA 

2 51 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC 7AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGC7AAT GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATTC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG ATTCACCAGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GATTGCCGAT GCCAAACTGC CCGTCCTCTT CGACACCC7AA GCCCCTACCG 

551 GCACCAACTA CTGGATTTAT GTCGCCACCG CCCTGCCCGT CTGCCTTGCG 

601 TCATTCGGTT TCCACGGCAA CGTCTCCAGC CTGCTC7AAAT ACTTTAAAGG 

651 CGACGCGCCC 7AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

7 01 TGGT7AATTTA CGTCCTCTGG CAAACCGCCA TCCAANGCAA CCTGCCGCGC 

751 7AACGAGTTCG CCCCCGTGAT TGCCGCCG7AA GGGCAAGTCT CCGTCNTGAT 

801 TGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC 7AAAATATTGT 

851 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 

901 GGACTCTTCG ACTACATCGC CGACATCTTC A7AATGG7AACG ACAGCGTGTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCT NTAATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTTGTTACCG CCATCGGNTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TNTACCGTTC 

1101 GCGC7AAAAAA TTCGGCGCAG GC7A7AAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCNTCAACAT CGCCGCACAN 

1201 GTATTGAGCC AAATGG7AACT CGTCCCCGTA TTTAAAGGAT AA 
1202 

This corresponds to the amino acid sequence <SEQ ID 362; ORF 102.a>: 
al02 .pep 

1 MPTKTPSLFG GAMI IAGTXI GAG ML AN PT A TSGVWFTGSL AVLLYTWFSM 

51 LSSGLMILEV NTHYPHG7AXF DTMVKDLLGR SWNIINGIAV AFVLYLLTYA 

101 YIFVGGDLTA KGLGSAAGGN VSLTVGQLVF FGILAFCVWA SARLVDRFTS 

151 VLIGGMVLTF IWATGGLIAD AKLPVLFDTQ APTGTNYWIY VATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIWTGT LIALVIYVLW QTAIQXNLPR 

251 NEFAPVIAAE GQVSVXIETL SKFAQTGNMD KILSLFSYMA IATSFLGVTL 

301 GLFDYIADIF KWNDSVSGRT KTAALTFLPP XISCLLFPTG FVTAIGYVGL 

351 AATVWTGIIP AMLLYRSRKK FGAGKTYKVY GGLWLMVWVF LFGIXNIAAX 

4 01 VLSQMELVPV FKG* 



ml02 / a!02 



95.9% identity in 413 aa overlap 
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10 20 30 40 50 60 

ml02 . pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVW FTGSLAVLLYTWFSMLSSGLMILEV 

I I : 1 M I I II I M I I I II I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I II I I II I I 
a 102 MPTKTPSLFGGAMIIAGTXIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml02 . pep NTHYPHGASFDTMVKDLLGRGWNIINGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 
I I I I M I I 11 I I I II I I I I : I I I I I II I I I I I I I I I I I I II I I I I M I I I I I I I I I M : 
a l02 NTHYPHGAXFDTMVKDLLGRSWNIINGIAVAFVLYLLTYAYI FVGGDLTAKGLGSAAGGN 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 02 . pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
I I I I ! 1 I I I I I I I I I I I I I I I I I I M i I I : I I I I M M I If I I : I I I I I I II I I I II I 
al02 VS LT VGQL VFFG I LAFCVW AS ARLVDRFTS VL I GGMVLT FI W ATGGL I ADAKL PVL FDTQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml02 . pep APAGTNYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 
I I : I I I I II I : I I I J I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I ! II I I I I I II I 
al02 APTGTNYWIYVATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml02 . pep QTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I It I 1 I I I I I I M I I M I I I I I I I I 1 M t I I i I I I I I I I I I I II I I M II I I I I I I I I 
a l02 QTAIQXNLPRNEFAPVIAAEGQVSVXIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml02 . pep GLFDYIADI FKWNDS ISGRTKTAALTFLPPLI SCLLFPTGFVTAIGYVGLAATVWTGI I P 

I I M I II M I I I I I I : M I I I I I I I I I I I I I I I I M I I I M I I I I I I I M I I M I I M I 
a 102 GLFDYIADI FKWNDS VSGRTKTAALTFLPPX I SCLLFPTGFVTAIGYVGLAATVWTGI IP 

310 320 330 340 350 360 

370 380 390 400 410 

ml02 . pep AMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIVNIAAQVLSQMELVPVFKGX 
| | I I I I M I I II 1 I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I 
al02 AMLL YRS RKK FGAGKT YKV YGGLWLM VW V FL FG I XN I AAXVLS QME L V PV FKGX 

370 380 390 400 410 
The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 363>: 

gl05 . seq 

1 Atgtccgcag aaaCATACAc acAAAtcggc tGGgtaggct taggGcaaat 

51 gGgtCtgcct atgGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCcgc CAAAGGAGCA 

151 AAAGTTTACG GCagcACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA AT CAT CGTCA ACATGAGCAC CATCTCCCCG 

3 01 ACCGAAAACC TCGCCGTCAA AG CACTTGTC GAAGCCGCAG GCGGACAGTT 

3 51 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

401 TGCTGATTCT GTTCGGCGGC AG CGAAG CCG TTTTAAACCC GCTGCAAAAA 

451 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 C ACT C AAAC A CGCTTCCAAA GAcctTAACC TCGccgtcAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

8 01 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 
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This corresponds to the amino acid sequence <SEQ ID 364; ORF 105.ng>: 

gl05 .pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RAC PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNS LLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDS PMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 Q AGNTL PAVE TVAAS YRKAV EAGYGEQDVS GVYLKLAEH 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 365>: 



ml05 . seq 












1 


ATGTCCGCAA 


ACGAATACGC 


ACAAATCGGC 


TGGaTAGGCT 


TAGGGCAAAT 


51 


GGGTCTGCCT 


ATGGTAACGC 


GGCTCTTGGA 


CGGCGGCATC 


GAAGTCGGCG 


101 


TATACAACCG 


CTCGCCCGAC 


AAAACTGCCC 


CCATCTCCGC 


CAAAGGCGCA 


151 


AAAGTTTACG 


GCAACACCGC 


CGAACTCGTC 


CGCGACTATC 


CCGTCATTTT 


201 


CCTGATGGTT 


TCCGACTATG 


CCGCCGTGTG 


CGACATCCTG 


AACGGAGTCC 


251 


GCGACGGATT 


GGCCGGCAAm 


ATCATCGTCA 


ACATGAGCAC 


CATCTCCCCG 


301 


AC CGAAAaGC 


TCGCCGTCAA 


AGCACTTGTC 


GAAGCGCAGm 


GaCAGTTTGC 


351 


CGAAGCACCC 


GTTTCCGGAT 


CGGTCGGGCC 


CGCCACCAAC 


GGCACGCTGC 


401 


TGATTCT GTT 


CGGCGGCAGC 


GAAcCGtTTT 


AAACCCGCTG 


CAAAAAATAT 


451 


TTTCCCTCGT 


CGGCAAAAAA 


ACCTTCCATT 


TCGG CGATGT 


CGGCAAAGGT 


501 


TCGGGCGCGA 


AACTCGTCTT 


GAACTCGCTC 


TTGGGCATTT 


TCGGCGAaCG 


551 


TAcAGCGAAs 


GmTgCTGATG 


GCGCGGCAGT 


TCGGCATCGA 


TACCGACACC 


601 


ATCGTCGAAG 


CCATCGGsGA 


CTCGGCAATG 


GACTCGCCCA 


TGTTCCAAAC 


651 


CAAAAAATCC 


CTGTGGGCAA 


ACCGCGAATT 


CCCGmCCGmC 


TTCGCCCTCA 


701 


AACACGCCTC 


CAAAGACCTC 


AACCTCGCCG 


TCAAAGAGCT 


TGAACAGGCA 


751 


GGCAACACCC 


TGCCCGCCGT 


CGAAACCGTT 


GCTGCCAGCT 


ACCGCAAAGC 


801 


AGTCGAAGCC 


GGCTACGGGA 


CACAGGACGT 


TTCCGGCGTT 


TACCTGAAAC 


851 


TGGCAGAACA 


CTGA 









This corresponds to the amino acid sequence <SEQ ID 366; ORF 105>: 

ml 05. pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVI FLMV SDYAAVCDIL NGVRDGLAGX IIVNMSTISP 

101 TEKLAVKALV EAQRQFAEAP VSGSVGPATN GTLLILFGGS EPFXTRCKKY 

151 FPSSAKKPSI SAMSAKVRAR NSSXTRSWAF SANVQRXXLM ARQFGIDTDT 

201 IVEAIGDSAM DSPMFQTKKS LWANREFPXX FALKHASKDL NLAVKELEQA 

251 GNTLPAVETV AAS YRKAVE A GYGTQDVSGV YLKLAEH 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 105 shows 79.9% identity over a 289 aa overlap with a predicted ORF (ORF 105 .ng) 
from N. gonorrhoeae: 

ml05/gl05 

10 20 30 40 50 60 

gl05 .pep 
ml05 



gl05 .pep 
ml05 



gl05 .pep 
ml05 

120 130 140 150 160 170 



MS AE TYTQ I GWVGLGQMGLPMVTRLLDGGI EVGVYNRS PDKTAP I SAKGAKVYGS TAELV 
III: hMM Mil I II III i I I II! I I M IN Ml II I II II II I I I I i III :| I II I 
MSANEYAQ I GW I GLGQMGLPMVTRLLDGG I EVGVYNRS PDKTAP I S AKGAKV YGNTAELV 
10 20 30 40 50 60 

70 80 90 100 110 120 

RACPVI FLMVSDYAAVCDI LNGVRDGLAGKI I VNMSTI SPTENLAVKALVEAAGGQFAEA 

I llllllllllllllllllllllllll I IN I II II M! : I II I I M I : Mill 

RDYPVI FLMVSDYAAVCDI LNGVRDGLAGX 1 1 VNMSTI S PTEKLAVKALVEAQR - QFAEA 
70 80 90 100 110 

130 140 150 160 170 180 

PVSGS VGPATNGTLLI LFGGSEAVLNPLQKI FS LVGKKT FHFGDVG KGSGAKLVLNS LLG 

II I I I III I I I II II I II III I : :| | :|| = = = = h 
PVSGS VGPATNGTLLI LFGGSEPFXTRCKKYF PS SAKKP -SI S AMS AKVRARNSSXTRSW 
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gl05 .pep 
ml05 



250 260 270 280 289 

gl 0 5 . pep DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGEQDVSGVYLKLAEH 

IIIIMIIIIMIIIIIIIMIIIIMMIIIIII IIIIMIIIIIII 

ml 0 5 DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGTQDVSGVYLKLAEH 

240 250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 367>: 

al05 . seq 

1 ATGTCCGCAA ACGAATACAC 

51 GGGTCTGCCT ATGGTAACGC 

101 TATACAACCG CTCGCCCGAC 

151 AAAGTTTACG GCAACACCGC 

2 01 CCTGATGGTT TCCGACTATG 
251 GCGACGGATT GGCCGGCAAA 

3 01 ACCGAAAACC TCGCCGTCAA 
351 TGCCGAAGCA CCCGTTTCCG 

4 01 TGCTGATTCT GTTCGGCGGC 
4 51 ATATTTTCCC TCGTCGGCAA 
501 AGGTTCGGGC GCGAAACTCG 
551 AAGCGTACAG CGAAGCGATG 
601 GACACCATCG TCGAAGCCAT 
651 CCAAACCAAA AAATCCCTGT 
701 CCCTCAAACA CGCCTCCAAA 
751 CAGGCAGGCA ACACCCTGCC 
801 CAAAGCAGTC GAAGCCGGCT 
851 TGAAATTGGC AG AAC ACT G A 

This corresponds to the amino acid sequence <SEQ ID 368; ORF 105.a>: 
al05 .pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



190 200 210 220 230 240 

IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDS PMFQTKKSLWANREFP PAFALKHASK 

h = = ! I I HI i I i ! M I II I I I I I I I I ! I M I M ( I | < I I | | F MINIM 

AFSANVQRXXLMARQFGIDTDTIVEAI GDSAMDSPMFQTKKSLWANREFPXXFALKHASK 
180 190 200 210 220 230 



ACAAATCGGC 
GGCTCTTGGA 
AAAACTGCCC 
CGAACTCGTC 
CCGCCGTGTG 
ATCATCGTCA 
AGCACTTGTC 
GATCGGTCGG 
AGCGAAGCCG 
AAAAACCTTC 
TCTTGAACTC 
CTGATGGCGC 
CGGCGGCTCG 
GGGCAAACCG 
GACCTCAACC 
CGCCGTCGAA 
ACGGCGAACA 



TGGATAGGCT 
CGGCGGCATC 
CCATCTCCGC 
CGCGACTATC 
CGACATCCTG 
ACATGAGCAC 
GAAGCCGCAG 
GCCCGCCACC 
TTTTAAACCC 
CATTTCGGCG 
GCTCTTGGGC 
GGCAGTTCGG 
GCAATGGACT 
CGAATTCCCA 
TCGCCGTCAA 
ACCGTTGCTG 
GGACGTTTCC 



TAGGGCAAAT 
GAAGTCGGCG 
CAAAGGCGCA 
CCGTCATTTT 
AACGGAGTCC 
CATCTCCCCG 
GCGGACAGTT 
AACGGCACGC 
GCTGCAAAAA 
ATGTCGGCAA 
ATTTTCGGCG 
CATCGATACC 
CGCCCATGTT 
CCCGCCTTCG 
AGAGCTTGAA 
CCAGCTACCG 
GGCGTTTACC 



ml05/al05 96.5% identity in 289 aa overlap 

10 20 30 40 50 60 

ml 05 . pep MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
M I I II : I I I I I II I I I I I I I I II I I I II I I ! I I I I M I I I M | | M j | | | | | | | | | | | | 
al05 MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 

10 20 30 40 50 60 

70 80 90 100 110 119 

ml05 . pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAG-QFAEA 
M I I I M I I M I I I I I I I I I I I I I M 11 M I I I I I i I II I I I I I I I I | | | | M | 1 | | f I 
al05 RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

120 130 140 150 160 170 179 

ml05 . pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKT FHFGDVGKGSGAKLVLNSLLG 

I M I I I I I I I I I I I I I I I I I I I M | I | || | M | | | | | | | | | | | | | | | | | j | | | | | j| | | | 
al05 PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKT FHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

180 190 200 210 220 230 
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ml05 . pep IFGDV-QRXMLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPXAFALKHASK 
IN:: : : I 1 I II I I M I M I I I I I I I I I I I I I M I I 1 I II I I I I I i I MINIMI 
al05 IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

240 250 260 270 280 

ml 05 .pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I M I I I M I I I I I I I I I I I I I M I M I I I I I I I I II II | | M I I I I I I I I 
al05 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 • 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 369>: 

gl05-l . seq 

1 ATGTCCGCAG AAACATACAC ACAAATCGGC TGGGTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGAGCA 

151 AAAGTTTACG GCAGCACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

2 51 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 CACTCAAACA CGCTTCCAAA GACCTTAACC TCGCCGTCAA AGAGCTTGAA 

7 51 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 370; ORP 105-l.ng>: 

gl05-l .pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RACP VIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 371>: 

ml 05-1 . seq 

1 ATGTCCGCAA ACGAATACGC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

2 01 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

2 51 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCcG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAnCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGsGACTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCG CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAACTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 372; ORF 105-1>: 

ml 05-1 .pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVI FLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 
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101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEXM LMARQFGIDT 

201 DTIVEAIGDS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

ml05-l/gl05-l 96.9% identity in 289 aa overlap 

10 20 30 40 50 60 

ml 0 5 - 1 . pep MSANE YAQIGWIGLGQMGLPMVTRLLDGGI EVGVYNRS PDKTAP I SAKGAKVYGNTAELV 
III: I : I I I I : I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I It I I II : I I I I I 
gl05-l MSAETYTQI GWVGLGQMGLPMVTRLLDGGI EVGVYNRS PDKTAP I SAKGAKV YGSTAELV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 0 5 - 1 . pep RDYPVI FLMVS D Y AAVC DI LNG VRDGLAGKI I VNMSTI S PTENLAVKAL VEAAGGQFAEA 
I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I 
g 1 0 5 - 1 RAC PVI FLMVS DYAAVCDI LNG VRDGLAGKI I VNMST I S PTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml05-l.pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I 
gl05-l PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 05- 1 . pep I FGEAYSEXMLMARQFGI DTDT I VEAI GDSAMDS PMFQTKKSLWANREFPPAFALKHASK 
M I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
gl05-l I FGEAYSEAMLMARQFGI DTDT I VEAI GGSAMDS PMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

250 260 270 280 290 

ml 05-1 .pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 1 I I I I M I I I I I I I I I I 
g 1 0 5 - 1 DLNLAVKELEQAGNTLPAVET VAAS YRKAVEAG YGEQDVSGV YLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 373>: 



al05-l . seq 

1 ATGTCCGCAA 

51 GGGTCTGCCT 

101 TATACAACCG 

151 AAAGTTTACG 

201 CCTGATGGTT 

251 GCGACGGATT 

301 ACCGAAAACC 

351 TGCCGAAGCA 

401 TGCTGATTCT 

4 51 ATATTTTCCC 

501 AGGTTCGGGC 

551 AAGCGTACAG 

601 GACACCATCG 

651 CCAAACCAAA 

701 CCCTCAAACA 

751 CAGGCAGGCA 

801 CAAAGCAGTC 

851 TGAAATTGGC 



ACGAATACAC 
ATGGTAACGC 
CTCGCCCGAC 
GCAACACCGC 
TCCGACTATG 
GGCCGGCAAA 
TCGCCGTCAA 
CCCGTTTCCG 
GTTCGGCGGC 
TCGTCGGCAA 
GCGAAACTCG 
CGAAGCGATG 
TCGAAGCCAT 
AAATCCCTGT 
CGCCTCCAAA 
ACACCCTGCC 
GAAGCCGGCT 
AGAACACTGA 



ACAAATCGGC 
GGCTCTTGGA 
AAAACTGCCC 
CGAACTCGTC 
CCGCCGTGTG 
ATCATCGTCA 
AGCACTTGTC 
GATCGGTCGG 
AGCGAAGCCG 
AAAAACCTTC 
TCTTGAACTC 
CTGATGGCGC 
CGGCGGCTCG 
GGGCAAACCG 
GACCTCAACC 
CGCCGTCGAA 
ACGGCGAACA 



TGGATAGGCT 
CGGCGGCATC 
CCATCTCCGC 
CGCGACTATC 
CGACATCCTG 
AC AT GAG C AC 
GAAGCCGCAG 
GCCCGCCACC 
TTTTAAACCC 
CATTTCGGCG 
GCTCTTGGGC 
GGCAGTTCGG 
GCAATGGACT 
CGAATTCCCA 
TCGCCGTCAA 
ACCGTTGCTG 
GGACGTTTCC 



TAGGGCAAAT 
GAAGTCGGCG 
CAAAGGCGCA 
CCGTCATTTT 
AACGGAGTCC 
CATCTCCCCG 
GCGGACAGTT 
AACGGCACGC 
GCTGCAAAAA 
ATGTCGGCAA 
ATTTTCGGCG 
CATC GAT ACC 
CGCCCATGTT 
CCCGCCTTCG 
AGAGCTTGAA 
CCAGCTACCG 
GGCGTTTACC 



This corresponds to the amino acid sequence <SEQ ID 374; ORF 105-La>: 

al05-l .pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRS PD KTAPISAKGA 

51 KVYGNTAELV RDY PVI FLMV SDYAAVCDIL NGVRDGLAGK 1IVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG I FGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

al05-l/ml05-l 99.0% identity in 289 aa overlap 



10 20 30 40 50 60 

a 1 05 - 1 . pep MSANEYTQIGWIGLGQMGLPMVTRLLDGGI EVGVYNRS PDKTAPI SAKGAKVYGNTAELV 
I I I I I I : I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I ] I | I | | | | 1 I I I I M I 
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ml05-l MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
10 20 30 40 50 60 

70 80 90 100 1X0 120 

al05-l pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
I | | j I | I | | | I I 1 I I I I I I I I I I II ! I t I I i I I I t I I I I I I I I I I I M 1 I I I I I I I I I I I 
ml 0 5 - 1 RDY PVIFLMVS D YAAVCDI LNGVRDGLAGKI I VNMST I SPTENLAVKAL VEAAGGQFAEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

al05-l pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

I | I | I | | | M | I I I I I I I I I 1 I I I I I I I I 1 I I I I I I I I I i I I I I I M I I I I I i II 

ml05-l pvSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

190 200 210 220 230 240 

al05-l pep IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 

i | | | | | | | 1 | | | 1 I I I I I I I I I I I I I 1 ! I I M MINI 

ml05-l I FGEAYSEXNLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

250 260 270 280 290 

a 1 0 5 - 1 pep DLNLAVKELEQAGNTLP AVETVAASYRKAVEAG YGEQDVSGV YLKLAEHX 
| | | | | | | | | | I I I I I I I I I I I I I I I I I I I M I I I I t I I I I I I I I I I I I I t 
ml05-l DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 375>: 

gl07 . seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGGTTG CCGATGCCAA 

51 ACCGTCCGTC CTCTTCGACA CCCAAGCCCC CGTCGGCACC GGCTACTGGA 

101 TTTACGCCGC CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTTT CCAGCCTGCT CAAATACTTT AAAGGCGACG cgcCcaaagt 

201 GgCGAaATCc atctggGcag gtacattggT TGCCttggta atttacgtcc 

251 TCTggcaaac cgccatCcaa agcaaccTGC cgcgcaacga gttcgcCCCc 

3 01 gtgattgccg ccgagaggca actCTCCGTC CTgaatgaaa cccTGtccaa 

351 attcgcccaa accggcgata tggataAaat attgtcccta tttccctaca 

401 tggcaatcgc cacctccttt ttaggcgTAA Ccttaggcct gtttgacaac 

451 atcgccggac atcttcaaat ggaacgacag tatgtccggg cggcaccaaa 

501 accgtcgcgc tga 

This corresponds to the amino acid sequence <SEQ ID 376; ORF 107.ng>: 
gl07 -pep 

1 MVLTFIWATG GLVADA KPSV LFDTQAPVGT GYWIYAATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAP KVAKS IW AGTLVALV IYVLWQTAI Q SNLPRNEFAP 

101 VIAAERQLSV LNETLSKFAQ TGDMD KILSL FPYMAI ATS F LGV TLGLFDN 

151 IAGHLQMERQ YVRAAPKPSR * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 377>: 

ml07 . seq 

1 ATGGTATTGA CCTTTATTTG GGCGGCCGGC GGGCTGATTG CCGATGCCAA 

51 GCCGTCCGTC CTCTTCGATA CCCAAGCCCC CGCCGGCACA AACTACTGGA 

101 TTTACGCCGs CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTCT CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 

2 01 GGCTAAATCC ATCTGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 
251 TCTGGCAAAC CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

3 01 GTCATCGCCG CCGAAGGGCA AGTCTCCGTC CTCATCGAAA CCCTGTCCAA 

3 51 ATTCGCCCAA ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 
401 TGGCGATCGC CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 

4 51 ATCGCCCATC TTCAAATGGA ACGACAGCAT CTCCGGgCCG CACCAAAACC 
501 GCCGCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 378; ORF 107>: 

ml 07 . pep . . 

1 MVLTFIWAAG GLIADA KPSV LFDTQAPAGT NYWIYAXTAL PVCLASFGFH 

51 GNVSSLLKYF KGDAP KVAKS IW TGTLIALV IYVLWQTAI Q GNLPRNEFAP 
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101 VIAAEGQVSV LIETLSKFAQ TGNMD KILSL FSYMAIATSF LGV TLGLFDY 
151 IAHLQMERQH LRAAPKPPR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 107 shows 89.4% identity over a 170 aa overlap with a predicted ORF (ORF 107.ng) 
from N. gonorrhoeae: 

ml07/gl07 

10 20 30 40 50 60 

ml 07 pep MVLTFIWAAGGLIADAKPSVLFDTQAPAGTNYWI YAXTALPVCLAS FGFHGNVSSLLKYF 

MMM IMUMMM II I II I II MM! MIMI MM M II M M MM II M M II I 

g 1 0 7 MVLTF I WATGGLVADAKPSVLFDTQAPVGTGYWI YAATALPVCLAS FGFHGNVSSLLKYF 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml07 pep KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

Ml Ml I I I MMIIMI I I I M I I III IMMI IMMIMI II hill lllllll! 

g i07 KGDAPKVAKS I WAGTLVALV I YVLWQTAI QSNLPRNEFAPVI AAERQLSVLNETLSKFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

ml07 . pep TGNMDKILSLFSYMAIATSFLGVTLGLFDYIA-HLQMERQHLRAAPKPPR 

IhlMIIII! MMIMMIMIMM 1 1 M 1 1 1 1 I = = I M 1 1 1 I 

gl 0 7 TGDMDKILSLFPYMAI ATSFLGVTLGLFDNI AGHLQMERQYVRAAPKPSR 

130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 379>: 
al07 . seq 

CCTTTATTTG GGCAACCGGC GGCCTGATTG CCGATGCCAA 

CTCTTCGACA CCCAAGCCCC TACCGGCACC AACTACTGGA 

CACCGCCCTG CCCGTCTGCC TTGCGTCATT CGGTTTCCAC 

CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 

ATGTGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 

CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

CCGAAGGGCA AGTCTCCGTC CTGATTGAAA CCCTGTCCAA 

ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 

CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 

TCTTCAAATG GAACGACAGC GTGTCCGGCC GCACCAAAAC 

ACCTTCCTGC CGCCTCTAAT TTCCTGCCTG CTCTTCCCCA 

TACCGCCATC GGCTACGTCG GCCTGGCGGC AACCGTCTGG 

TCCCCGCCAT GCTGCTCTAC CGTTCGCGCA AAAAATTCGG 

ACCTATAAAG TTTACGGCGG CTTGTGGCTG ATGGTTTGGG 

CGGCATCGTC AACATCGCCG CACAGGTATT GAGCCAAATG 

CCGTATTTAA AGGATAA 

This corresponds to the amino acid sequence <SEQ ID 380; ORF 107. a>: 
al07 .pep 

1 MVLTFIWATG GLIADAKLPV LFDTQAPTGT NYWIYVATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IWTGTLIALV I YVLWQTAI Q GNLPRNEFAP 

101 VIAAEGQVSV LIETLSKFAQ TGNMDKILSL FSYMAIATSF LGVTLGLFDY 

151 IADIFKWNDS VSGRTKTAAL TFLPPLISCL LFPTGFVTAI GYVGLAATVW 

201 TGIIPAMLLY RSRKKFGAGK TYKVYGGLWL MVWVFLFGIV NIAAQVLSQM 

2 51 ELVPVFKG* 

ml07/al07 94.8% identity in 154 aa overlap 

10 20 30 40 50 60 

ml07 .pep MVLTFIWAAGGLIADAKPSVLFDTQAPAGTNYWI YAXTALPVCLAS FGFHGNVSSLLKYF 

I I I I I M I : I I I I I M I I I M II I I : I I I I I i I : I I I I I I I I I I I I I I I I I I I 1 I I I 
al07 MVLTFIWATGGLIADAKLPVLFDTQAPTGTNYWIYVATALPVCLASFG FHGNVSSLLKYF 

10 20 30 40 50 60 



1 


ATGGTATTAA 


51 


ACTGCCCGTC 


101 


TTTATGTCGC 


151 


GGCAACGTCT 


201 


GGCTAAATCC 


251 


TCTGGCAAAC 


301 


GTGATTGCCG 


351 


ATTCGCCCAA 


401 


TGGCGAT CGC 


451 


ATCGCCGACA 


501 


CGCCGCGCTG 


551 


CCGGCTTTGT 


601 


ACAGGCATCA 


651 


CGCAGGCAAA 


701 


TCTTCCTTTT 


751 


GAACTCGTCC 
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70 80 90 100 110 120 

ml 07 . pep KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 
| | | | M | M | I I ! II II I I I I I I I I I I I I I I I I I I M I I t I I I I I I I I I i M I M I I I I I 
al 07 KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

ml 07 . pep TGNMDKILSLFSYMAIATSFLGVTLGLFDYIAHLQMERQHLRAAFKPPRX 

I I | I I I I I I I I I II I I I i I II I I I I I I I I I I I : 
a 107 TGNMDKILSLFSYMAIATSFLGVTLGLFDYIADIFKWMDSVSGRTKTAALTFLPPLISCL 

130 140 150 160 170 180 

al 07 LFPTGFVTAIGYVGLAATVWTGI I PAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGI V 

190 200 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 38 1>: 

gl08 . seq 

1 ATGttgccgg gCTTCAACCG GAT ATT C Aa a cggTTTGCTC CAACACTCGG 

51 AAcggCGCAT AAAACGCCgc ccTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATCCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGCG G C AAT CTTAC TCGGCGGGTG 

201 CGCCGCCGGC GGCAACACAT TCGGCAGCTT AGACGGCGGC ACGGGTATGG 

251 GTGGCAGCAT CGTCAAAATG ACGGTAGAAA gccAATGCCG TGCGGAATTG 

3 01 GACAGGCGCA GCGAATGGCG TTTGACCGCG CTGGCGATGA GTGCCGAAAA 

3 51 ACAGGCGGAA TGGGAAAACA AGATTTGCGG CTGCGCTACC GAAGAAGCAC 

4 01 CTAACCAGCT GACCGGCAAC GATGTGATGC AGATGCTGAa ccagtccacG 
4 51 CGCaatcagg cacTtgccgc CCtgaccgTC AAAacggtTT CcgcctgcTT 
501 CAaacgcctg tACCGCTAa 

This corresponds to the amino acid sequence <SEQ ID 382; ORF 108.ng>: 

gl08 -pep 

1 MLPGFNRIFK RFAP TLGTAH KTPPFALSRT GRLIRSYRHK RRGFNRKGIE 

51 MNKT LSILPA AILLGGCAAG GN TFGSLDGG TGMGGS I VKM TVESQCRAEL 

101 DRRSEWRLTA LAMSAEKQAE WENKI CGCAT EEAPNQLTGN DVMQMLNQST 

151 RNQALAALTV KTVSACFKRL YR* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 383>: 

ml08 . seq 

1 ATGTTGCCGG GCTTCAACCG GATATTCAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGA GGCGGTAACA CATTCGGCAG CTTAGACGGT GGCACAGGCA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTTG GGAGCCAATG CCGTGCGGAA 

301 TTGGACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAGTGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

4 01 CACCCGAACG GATGACCGGC AACGATGTGA TGCAGATGCT GGCTCCGTCC 

4 51 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 384; ORF 108>: 

ml 0 8 -pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVGS QCRAE 

101 LDKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPERMTG NDVMQMLAPS 

151 TRNQALAALT AKTVSACFKH LYR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 108 shows 89.6% identity over a 173 aa overlap with a predicted ORF (ORF 108.ng) 
from N. gonorrhoeae: 

ml08/glO8 
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10 20 30 40 50 60 

ml08 pep MLPGFNRI FKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

i 1 1 1 I I I I i I I I ^ I J t I I I I f I E 1 1 1 I 1 I 1 1 1 I I f I I I I I I I I I I I i I I I I 1 1 I ( 1 1 1 ^ 

gl08 MLPGFNRIFKRFAPTL.GTAHKTPPFADSRTGRLIRSYRHKRRGFNRKGIEMNKTLSILPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml08 pep AILLGGCAA.GGGNTFGSLDGGTGMGGSIVKMAVGSQCRAEIiDKRSEWRLTALAMSAEKQA 

I I 1 I M I I I I I IMIIIIIIIIIIIIIIIhl 1 M I I I I M I I ! I I U I I I I I il I i 

gl o 8 AI LLGGCAAGG- NTFGSLDGGTGMGGS I VKMTVESQCRAELDRRSEWRLTALAMSAEKQA 

70 80 90 100 110 

130 140 150 160 170 

ml 0 8 . pep EWENKI CACVAQEAPERMTGNDVMQMLAPSTRNQALAALTAKTVSACFKHLYRX 

M!|| | [:| :::| |!:::||| || III | | | | | | | | | | || : | | | | | || | = | | | 
g 1 0 8 EWENKI CGCATEE APNQLTGNDVMQMLNQSTRNQALAAIiTVKTVS ACFKRL.YRX 

120 130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 385>: 

al08 . seq 

1 ATGTTGCCGG GCTTCAACCG GAT AT T C AAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGG GGCGGTAACA CATTCGGCAG CTTAGACGGC GGCACAGGTA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTAG AAAGCCAATG CCGTGCGGAA 

301 TTGAACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAATGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

4 01 CACCCAACCA GCTGACCGGC AACGATGTGA TGCAGATGCT GGATCCGTCC 

4 51 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 386; ORF 108.a>: 

al08 .pep 

1 MLPGFNRI FK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVESQCRAE 

101 LNKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPNQLTG NDVMQMLDPS 

151 TRNQALAALT AKTVSACFKH LYR* 



ml08/al08 



96.5% identity in 173 aa overlap 



10 20 30 40 50 60 

m!08 .pep MLPGFNRI FKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRG FNRKGIEMNKTLSILPV 

M M It I I I I I I i I I M I I I I I I I I I I I I I I I I I M I I I I i I I I I I I I I I I I I I I I I I I I 
al08 MLPGFNRI FKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml08 .pep AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 
I I I I I I I I II I I I I I I I I II I I I I M I I I I I II I I I I I I I : I I I I M I I I I 1 1 I II II I 
al08 AILLGGCAAGGGNTFGSLDGGTGMGGS I VKMAVESQCRAELNKRSEWRLT ALAMSAEKQA 

70 80 90 100 110 120 



130 140 150 160 170 

ml08 . pep EWENKI CACVAQEAPERMTGNDVMQMLAPSTRNQALAALTAKTVSACFKHLYRX 

I I I I I I I I I I I I I I I ::: II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i 
a!08 EWENKI CACVAQEAPNQLTGNDVMQMLDPSTRNQALAALTAKTVSACFKHLYRX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 387>: 
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g!09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GATCGTAGGC GTATGCTTAC CGCTTTTGGA AGCGGGCATG 

101 GAAATGACGC GCAAAGGCAA AACCACCCAA TCCGCCGCCA TCGTGGTGTT 

151 CTCTTCCGTC TGGTCAATCC GGTTTTCGGC TGGGCGTTGA CGATGCTGTT 

201 GGATAATTTG GGCTTAATCG GCTGCAAAGA ACGCAGCGCG CAATTAGGTT 

2 51 TTGTCGGACG AGTATTGATA CCCGCAGTAG GTTTCTTAAT CTTGTGTGTG 

3 01 GCGATGGGTG CGGTCGGGAT GCTGCCCGGT ATCCCTCCGT TTTTGGAGCA 
3 51 GTTCAAATCT TTGGGCTAG 

This corresponds to the amino acid sequence <SEQ ID 388; ORF 109.ng>: 

gl09 .pep 

1 MYYRRWGLS DGLGDLAAG I DRRRMLTAFG SGHGNDAQRQ NHPIRRHRGV 

51 LFRLVNPVFG WALTMLLDNL GLIGCKERSA QLGFVGR VLI PAVGFLILCV 

101 AMGAVGMLPG IPPFLEQFKS LG 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 389>: 

ml09 . seq 

1 ATGTATTATC GCCGGGTTAT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CATCGTGGTG TTCTCTTCCG CCTTGTCAAT CCGGTTTTCG GCTGGGCGTT 

201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGTG 

2 51 CGCAATTAGG TTTCGCCGGA CGCGTGTTGA TACCCGCAGT AGGTTTCTTG 
301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

3 51 GTTTTTGGAA CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 109>: 

ml 0 9 . pep 

1 MYYRRVMGLS DGLGDLAAG I ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLI GCK ERSAQLGFAG R VLI PAVGFL 

101 ILCVAMGA VG MLPGIPPFLE HFKSLG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from AT. gonorrhoeae 

ORF 109 shows 92.9% identity over a 126 aa overlap with a predicted ORF (ORF 109.ng) 
from N, gonorrhoeae: 

ml09/gl09 

10 20 30 40 50 60 

MY YRRVMGLSDGLGDLAAG I ERSLGRRRI LTAFGSGHGNDAQRQNHP I RRHRGVLFRLVN 

I I I ! i hill I II I I M MM I h I M ! I 1 1 1 M I li 1 1 1 I II I i II I 1 1 I 1 1 I ! 

MYYRRWGLS DGLGDLAAG I DR RRMLTAFGS GHGNDAQRQNHP I RRHRGVLFRLVN 

10 20 30 40 50 

70 80 90 100 110 120 

PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLI PAVGFL I LCVAMGAVGMLPGI PPFLE 

i 1 1 I I I I 1 1 1 1 1 1 1 [ i 1 1 1 ! 1 1 1 1 1 1 1 1 = f I 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 r 

PVFGWALTMLLDNLGLIGCKERSAQLGFVGRVLI PAVGFLI LCVAMGAVGMLPGI PPFLE 
60 70 80 90 100 110 



ml09 .pep 
gl09 

ml 09 .pep 
gl09 



ml 09 . pep HFKSLGX 

= I I I I I 
gl09 QFKSLGX 

120 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 391 >: 

al09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CACCGTGGTG TTCTCTTCCG CTTGGTCAAT CCGGTTTTCG GCTGGGCGTT 
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201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGCG 

251 CGCAATTAGG TTTCACCGGA CGCGTATTGA TACCCGTAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAG CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 392; ORF 109>: 

al09 .pep 

1 MYYRRVVGLS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFTG R VLIPWGFL 

101 ILCVAMGAVG MLPGIPPFLE HFKSLG* 



ml09/al09 97.6% identity in 126 aa overlap 

10 20 30 40 50 60 

ml09 . pep MYYRRVMGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 
I I I I I I : I I I I I I I t I I I I I I M M I I I I I 1 I I I I ! I I I I I I I I I M II I I I I I I I I! i I 
al0 9 MY YRR WGL S DGLGD LAAG I ERS LGRRRI LT AFGSGHGN DAQRQNH P I RRHRG VL FRLVN 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml09 . pep PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLIPAVGFLILCVAMGAVGMLPGIPPFLE 
I I I I I I I I I M i I I II I I I I I 1 I I I I I I : I I I I i I : I I I I I I I I I I [ I I I I I M I I I I I I 
al09 PVFGWALTMLLDNLGLIGCKERSAQLGFTGRVLIPVVGFLILCVAMGAVGMLPGIPPFLE 

70 80 90 100 110 120 

ml 0 9. pep HFKSLGX 
I I I I I I I 

al09 HFKSLGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 393>: 

gill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGaacaaacC GCGCAaaccg 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCtATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGtccaCC TACCAGACCG 

251 ATTCCGAAAT CAGCCGGTTt atacagacan atgctggaga gctcttcgcg 

3 01 tntcatgcag nttctataac tgattccgcc gaagactgtc tgcctaatac 

351 gcctatctca tcggcgctct ga 

This corresponds to the amino acid sequence <SEQ ID 394; ORF 111 .ng>: 

gill .pep 

1 MPSETRLPNL IRALIFALGF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF IQTAGELFAH 

101 ASITDSAEDC LPNTPISSAL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 395>: 

mill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATAyCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AwAAACGCAT 

201 CGATGACGCG CTTAAAGAAk TCAACCGGyA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

3 01 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

3 51 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 
401 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 
501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
551 ATTTATCTTC GATTG CCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
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6 01 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 
651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 
701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 
751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 
851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 
901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 
951 CTTAAAGCTG GC AG AG CG CG AAAAACTCGC TGTTTTCCTG ATTGT C AGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGcTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 396; ORF 1 1 1>: 

mill -pep 

1 MPSETRLPNF IRVLIFALGF IFLNA CSEQT AQTVTLQGET MGTTYXVKYL 
51 SNNRDKLPSP AEIXKRIDDA LKEXNRXMST YQPDSEISRF NQHTAGKPLR 
101 I S SDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 
151 IKQAASYTGI DKIIL.KQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 
201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQTIVPL 
251 NNRSLATSGD YRI FHVDKNG KRLSHIINPN NKRPISHNLA SIS WADS AM 

3 01 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 
351 R* 

ORF 1 1 1 shows 88.7% identity over a 97 aa overlap with a predicted ORF (ORF 1 1 1 .ng) 
from TV. gonorrhoeae: 

mill .pep/gill .pep 

10 20 30 40 50 60 

mill . pep MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSP 

I M I I II I h I ^ I IN i i M I I I I i I IMM ! I I I M II I I I hi I i I ■ i ! I I I I I I ! 

gill MPSETRLPNLIRALIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 

70 80 90 100 110 120 

mill . pep AE IXKRIDDALKEXNRXMS T YQPDSE I SRFNQHTAGKPLRI S SDFAHVTAEAVRLNRLTH 

hi MINIMI II I M II MMMI I Mh 

gill AKIQKRIDDALKEVNRQMSTYQTDSEISRFIQTXAGELFAXHAXSITDSAEDCLPNTPIS 

70 80 90 100 110 120 

130 140 150 160 170 180 

mill . pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQI KQAAS YTG ID KI I LKQGKD YASLSKTHPK 

gill SALX 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 397>: 

alll . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TAT C AG C C CG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 
451 ATCAAACAAG CAGCATCTTA TACGGGCATA G AC AAAAT C A TTTTGAAACA 
501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 
651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 
701 AACAGCCCAA CATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

7 51 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
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801 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

8 51 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 398; ORF 1 1 l.a>: 
alll .pep 

1 MPSETRLPNF IRTLIFALSF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 I SSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKI ILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

mlll/alll 97.7% identity in 351 aa overlap 

10 20 30 40 50 60 

mill . pep MPS ETRLPN FI RVL I FALG FI FLNACSEQTAQT VTLQGETMGTT YXVKYLSNNRDKL PS P 

I M I I M I I I I I : It I i t : I I I I I I I I I I I I I I I I I I I I I I M I I : I I I I M I M I I I I I 
alll MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 11. pep AEIXKRIDDALKEXNRXMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
Ml I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I II I I M I I I : I I I I I I 
alll AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 11. pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I | I I f I I I I I I I I I I I I I I M I I I II I I I I I I ! I I i I I ! M I I I I I I I I II I I I I I I I I I 
alll GALDVTVGPLVNLWGFGPDKSVTREPS PEQIKQAAS YTGI DKI ILKQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

mill .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I M M I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
alll AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGPCNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

mill, pep GGNTQIIVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISWADSAM 
I I I I I I I I I I I I I I I I I I I I I I M I I I I : M I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
alll GGNTQI IVPLNNRSLATSGDYRI FHVDKSGKRLSHI INPNNKRPI SHNLASI SWADSAM 

250 260 270 280 290 300 

310 320 330 340 350 

mill . pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
I 1 I I II I I I I I ! I I I I I I I I I I I I I I I I II I I I I I I I M I I I I II M M I M 
alll TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 399>: 

glll-l. seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAacCG 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TACCAGACCG 
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251 ATTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ATTTCGCACA CGTTACCGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACTCAC GGCGCACTGG ACGTAACCGT CGGCCCTTTG GTCAACCTTT 

4 01 GGGGGTTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

451 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGCAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAA GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCggcGAGTT 

651 GCACGGCAAA GGCAAAAATG CGCACGGCGA ACCGTGGCGC ATCGGTATAG 

701 AGCAACCCAA TATcatccaa ggcggcaata cgcAGattat cgtcccgctg 

751 aaCaaccgtt cgcttgccac ttccggcgAT taccgtaTTT tccacgtcgA 

801 TAAAAACGGC Aaacgccttt cccacATCAT CAATCCCAAC AACAAACGAC 

851 CCATCAGcCA CAAcctcgcc tcCATCAgCg TGGTCTCAGA CAGTGCAATG 

901 ACGGCGGACG GTTTATCCAC AGGATTATTT GTTTTAGGCG AAACCGAAGC 

951 CTTAAGGCTG GCAGAACAAG AAAAACTCGC TGTTTTCCTA ATTGTCCGGG 

1001 ATAAGGACGG CTACCGCACC GCCATGTCTT CCGAATTTGC CAAGCTGCTC 

1051 CGCTAA 



This corresponds to the amino acid sequence <SEQ ID 400; ORF 1 1 l-l.ng>: 

glll-l .pep 

1 MPSETRLPNL IRALIFALGF IEXNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILQQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNAHGEPWR IGIEQPNIIQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISWSDSAM 

301 TADGLSTGLF VLGETEALRL AEQEKLAVFL IVRDKDGYRT AMSSEFAKLL 

351 R* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 40 1>: 

mlll-l . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAACGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

401 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AG G ATT ATT C GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 



This corresponds to the amino acid sequence <SEQ ID 402; ORF 1 1 1-1>: 

ml 11-1 .pep 

1 MPSETRLPNF IRVLIFALGF IFLMAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



mlll-l/glll-1 96.6% identity in 351 aa overlap 



10 20 30 40 50 60 

mlll-l. pep MPS ETRLPNFI RVL I FALGFI FLNACSEQTAQTVTLQGETMGTT YTVKYLSNNRDKLPS P 
I I I! II I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
glll-l MPSETRLPNLI RALI FALGFI FLNACSEQTAQTVTLQGETMGTT YTVKYLSNNRDKLPS P 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml 11-1 .pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I f I I 
glll-1 AKIQKRIDDALKEVNRQMSTYQTDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

mlll-1 .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I I I I M I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I II I I I I I I I I i 
glll-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILQQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

mlll-1 .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I f I I I I I II I I 1 I I I I I I I I I I I I I I M I I ! I I I I I I I 1 I I I : I I I I I I I I I I I I I : I 
glll-1 AYLDLSSI AKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNAHGEPWRIGIEQPNIIQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

ralll-1 .pep GGNTQIIVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISVVADSAM 
I I I I I I I I I I I I I I I I I I II I II I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I : ! I I I 
glll-1 GGNTQIIVPLNNRSLATSGDYRI FHVDKNGKRLSHIINPNNKRPISHNLASISWSDSAM 

250 260 270 280 290 300 

310 320 330 340 350 

ml 1 1 - 1 . pep TADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGGYRTAMSSEFEKLLRX 
I I I I I I I I I I I I I I I I I I : I I I : I I i I I I I I I I I I I I I I I I I I I I I I I I I 
glll-1 TADGLSTGLFVLGETEALRLAEQEKLAVFLIVRDKDGYRTAMSSEFAKLLRX 

310 320 330 340 350 

glll-l/p44550 

sp| P4 4550 | YOJL_HAEIN HYPOTHETICAL LIPOPROTEIN HI0172 PRECURSOR >gi | 1074292 | pir | | C64144 
hypothetical protein HI0172 - Haemophilus influenzae (strain Rd KW20) >gi | 1573128 (U32702) 
lipoprotein, putative [Haemophilus influenzae Rd] Length =34 6 
Score = 349 bits (885) , Expect = 2e-95 

Identities = 177/328 (53%), Positives = 240/328 (72%), Gaps = 4/328 (1%) 

Query: 23 LNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSPAEIXKRIDDALKEXNRXMSTYQ 82 

L AC ++T + ++L G+TMGTTY VKYL + S + + 1+ LK+ N MSTY+ 

Sbjct: 17 LAACQKET -KVI SLSGKTMGTT YHVK YLDDGS I TATS - EKTHEEI EAI LKDVNAKMST YK 74 

Query: 83 PDSEISRFNQHT-AGKPLRISSDFAHVTAEAVRLNRLTHGALDVTVGPLVNLWGFGPDKS 141 

DSE+SRFNQ+T P+ IS+DFA V AEA+RLN++T GALDVTVGP+VNLWGFGP+K 

Sbjct: 75 KDSELSRFNQNTQVNTPIEISADFAKVLAEAIRLNKVTEGALDVTVGPVVNLWGFGPEKR 134 

Query: 142 VTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPKAYLDLSSIAKGFGVDKVAGEL 201 

++P+PEQ+ + ++ GIDKI L K+ A+LSK P+ Y+DLSSIAKGFGVD+VA +L 
Sbjct: 135 PEKQPTPEQLAERQAWVGIDKITLDTNKEKATLSKALPQVYVDLSSIAKGFGVDQVAEKL 194 

Query: 202 EKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQGGNTQIIVPLNNRSLATSGDY 261 

E+ QNY+VEIGGE+ KGKN G+PW+I IE+P + ++ LNN +A+SGDY 

Sbjct: 195 EQLNAQNYMVEIGGEIRAKGKNIEGKPWQIAIEKPTTTGERAVEAVIGLNNMGMASSGDY 254 

Query: 262 RIFHVDKNGKRLSHIINPNNKRPISHNLASISVVADSAMTADGLSTGLFVLGETEALKLA 321 

RI+ ++NGKR +H I+P PI H+LASI+V+A ++MTADGLSTGLFVLGE +AL++A 
Sbjct: 255 RI Y-FEENGKRFAHEIDPKTGYPIQHHLASITVLAPTSMTADGLSTGLFVLGEDKALEVA 313 

Query: 322 EREKLAVFLIVRDKGGYRTAMSSEFEKL 34 9 

E+ LAV+LI+R G+ T SS F+KL 
Sbjct: 314 EKNNLAVYLI IRTDNGFVTKSSSAFKKL 341 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 403>: 

alll-l . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 
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351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 



CCTGACACAC 
GGGGATTCGG 
ATCAAACAAG 
AGGCAAAGAT 
ATTTATCTTC 
C T GG AAAA AT 
GCACGGCAAA 
AACAGCCCAA 
AACAACCGTT 
TAAAAGCGGC 
CCATCAGCCA 
ACGGCGGACG 
CTTAAAGCTG 
ATAAAGGCGG 
CGCTAA 



GGCGCGCTGG 
CCCCGACAAA 
CAGCATCTTA 
TACGCTTCCT 
GATTGCCAAA 
ACGGCATTCA 
GGCAAAAACG 
CATCGTCCAA 
CGCTTGCCAC 
AAACGCCTCT 
CAACCTCGCC 
GCTTGTCCAC 
GCAGAGCGCG 
CTACCGCACC 



ACGTAACCGT 
TCCGTTACCC 
TACGGGCATA 
TGAGCAAAAC 
GGCTTCGGCG 
AAATTATCTG 
CGCGCGGCGA 
GGCGGCAATA 
TTCCGGCGAT 
CCCATATCAT 
TCCATCAGCG 
AGGATTATTC 
AAAAACTCGC 
GCCATGTCTT 



CGGCCCCTTG 
GTGAACCGTC 
GACAAAATCA 
CCACCCCAAG 
TTGATAAAGT 
GTCGAAATCG 
ACCTTGGCGC 
CGCAGATTAT 
TACCGTATTT 
TAATCCGAAC 
TGGTCGCAGA 
GTATTGGGCG 
TGTTTTCCTG 
CCGAATTTGA 



GTCAACCTTT 
GCCGGAACAA 
TTTTGAAACA 
GCCTATTTGG 
TGCGGGCGAA 
GCGGCGAGTT 
ATCGGCATCG 
CGTCCCGCTG 
TCCACGTCGA 
AACAAACGAC 
CAGTGCGATG 
AAACCGAAGC 
ATTGTCAGGG 
AAAACTGCTC 



This corresponds to the amino acid sequence <SEQ ID 404; ORF 1 1 1-1. a>: 



alll-l -pep 



MPSETRLPNF IRTLIFALSF I FLKACS EQT 



51 SNNRDKLPSP 

101 ISSDFAHVTA 

151 IKQAASYTGI 

201 LEKYGIQNYL 

251 NNRSLATSGD 

301 TADGLSTGLF 

351 R* 



AEIQKRIDDA 
EAVHLNRLTH 
DKIILKQGKD 
VEIGGELHGK 
YRIFHVDKSG 
VLGETEALKL 



LKEVNRQMST 
GALDVTVGPL 
YASLSKTHPK 
GKNARGEPWR 
KRLSHIINPN 
AEREKLAVFL 



AQTVTLQGET 
YQPDSEISRF 
VNLWGFGPDK 
AYLDLSSIAK 
IGIEQPNIVQ 
NKRPISHNLA 
IVRDKGGYRT 



MGTTYTVKYL 
NQHTAGKPLR 
SVTREPSPEQ 
GFGVDKVAGE 
GGNTQIIVPL 
SIS WADS AM 
AMSSEFEKLL 



alll-l/mlll-1 



98.9% identity in 351 aa overlap 



10 20 30 40 50 60 

alll-l. pep MPSETRLPNFIRTLIFALSFI FLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
I I 1 I I I I I I I I I : I I I I I : I I I I I I I 1 I I I I I I I I 1 I I I I I I I I I I I I I I I I f I I I 1 1 I I 
ml 11-1 MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 

70 80 90 100 110 120 

all 1-1. pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 
I I I I I I I M I I I I I I I 1 I I I I I I I t M I II I II I 1 I I I I I I I I I ! I I I I I I I I : I I I I i 1 
ml 11-1 AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

alll-l - pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
i I I I I II I II I I I I I i I 1 I! I M I I I I I I I I I I I I I I I I i I I I I I I I M II II I I I I I I I 
mil 1-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKI ILKQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

alll-l .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I II I I I 
ml 1 1 - 1 AYLDLS S I AKGFGVDKVAGELEKYGI QN YLVEIGGELHGKGKNARGEPWRI GI EQPNI VQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

alll-l. pep GGNTQIIVPLNNRSLATSGDYRI FHVDKSGKRLSHI INPNNKRPISHNLASISVVADSAM 
t I I I I I I I I I I I I 1 I I II I I I I I I I I II : I I I I I I I I I I I I I I I I I I I I I II I I I I I 11 I 
mil 1-1 GGNTQI IVPLNNRSLATSGDYRI FHVDKNGKRLSHIINPNNKRPISHNLASISVVADSAM 

250 260 270 280 290 300 



310 320 330 340 350 

alll-l. pep T ADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGG YRT AMS SEFEKLLRX 
I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
ml 1 1 - 1 T ADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGG YRT AMS SEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 405>: 

gll4 . seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCAGCAGG AATGCAGCAA 
51 GACTTTTTTA TGTCCGCCGG GCGGGACGAG TATGGGGCGG TCAATGTCGG 
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101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTCGAA 

151 TACGGTCAAA GCGGCTATTT TACCAGAGCC G C CGAATGT A AAACAGGGTG 

201 TCAGGGCATC AGCCCGAGCT GCCTGAACGA ACGGACGGTT TGCGAGGTAA 

2 51 CGATAAAATG GTCGAGCAGC GAAACATCAA CCAGCGACAT GGCCTGTGCC 

3 01 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAAGGTTCAG GCGAGCCGCC 

3 51 CGGATGGTTG TGCGCGATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

4 01 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 406; ORF 1 14.ng>: 

gl!4 .pep 

1 MASITSPLHG AQQECSKTFL CPPGGTSMGR S MSVTVGLFC VSINLTISV E 

51 YGQSGYFTRA AECKTGCQGI SPSCLNERTV CEVTIKWSSS ETSTSDMACA 

101 SRLVNMMSSC EGSGEPPGWL CAIIRLSAYS SNASLTISRM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 407>: 



mll4 . seq 






1 


ATGGCTTCCA 


TCACTTCGCC 


51 


GACTTTTTTA 


TGTCCACCGG 


101 


TAACGGTAGG 


TTTGTTTTGT 


151 


TACGGTTGAA 


GCGGCTATTT 


201 


TCAGGGCATC 


AACCCGAGCT 


251 


CGATAAAATG 


GTCGAGCAGC 


301 


AGCCGCCTTG 


TGAACATGAT 


351 


CGgATGGTTG 


TGCGCAATAA 


401 


GTTTGACGAT 


TTCGCGGATG 



This corresponds to the amino acid sequence <SEQ ID 408; ORF 1 14>: 

ml 14 -pep 

1 MASITSPLHG AHRECSKTFL CPPGGTSIGR S MSVTVGLFC VSINLTISV E 

51 YGXSGYFIRA AACKTECQGI NPSCLNEQTL CXVTIKWSSS DTSTSDIACA 

101 SRLVNMMSSC EXSGEPPGWL CAIIRLSAYS SNASLTISRM * 



mll4/gll4 90.0% identity over a 140 aa overlap 

10 20 30 40 50 60 

mll4 . pep MASITSPLHGAHRECSKTFLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSGYFIRA 

!)MIIIIII|::|I!IIIII!II1I|:|1I!IIMII!1IIII!I1IIIII 1 1 1 1 II 
gll4 MAS I TSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLT I SVE YGQSGYFTRA 

10 20 30 40 50 60 

70 80 90 100 110 120 

mll4 .pep AACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGEPPGWL 

I III Nihil I llh hi lllllllhllllhllllllllllllll MM III I 

gll4 AECKTGCQGI SPSCLNERTVCEVTI KWSSSETSTSDMACASRLVNMMSSCEGSGEPPGWL 

70 80 90 100 110 120 



130 140 
mll4 -pep CAIIRLSAYSSNASLTISRMX 

Mill 1 1 1 II I II I M MM 

gll4 CAIIRLSAYSSNASLTISRMX 

13 0 14 0 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 409>: 

all4 . seq 

i ATGCCGGAGG CAAGCAT CGC CTCCATCACT TCGCCGCTGC ACGGGGCGCA 

51 ACAGGAATGC AGCAAGACTT TTTTATGTCC GCCGGGCGGG ACGAGTATGG 

101 GGCGGTCAAT GTCGGTAACG GTAGGTTTGT TTTGTGTTTC CATTAACTTA 

151 ACGATATCTG TCGAATACGG TTGAAGCGGC TATTTTATCA GAGCCGCCGC 

201 ATGTAAAACA GGGTGTCAGG GCATCAGCCC GAGCTGCCTG AACGAACGGA 

251 CGGTTTGCGC CGTTACGATA AAATGGTCGA GCAGCGACAC AT CG AC C AG C 

301 GACATTGCCT GTGCCAGCCG CCTTGTGAAC ATGATGTCTT CCTGCGAAGG 

351 TTCGGGCGAG CCGCCCGGAT GGTTGTGCGC GATAATCAGG CTGTCGGCAT 

4 01 ATTCGTCCAA TGCCAGTTTG ACAATTTCAC GGATGTAA 



WO 99/57280 



PCTAJS99/09346 



329 



This corresponds to the amino acid sequence <SEQ ID 410; ORF 1 14.a>: 

all4 .pep 

1 MPEASIASIT SPLHGAQQEC SKT FLCPPGG TSMGRSM SVT VGLFCVSINL 

51 TI5V EYG*SG YFIRAAACKT GCQGISPSCL NERTVCAVTI KWSSSDTSTS 

101 DIACASRLVN MMSSCEGSGE PPGWLCAIIR LSAYSSNASL TISRM* 

mll4/all4 92.9% identity in 140 aa overlap 

10 20 30 40 * 50 

ml 14 .pep MASITSPLHGAHRECSKT FLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSG 

: 1 I M I I t I I I : : I M I 1 I j M I I I i I : I I I I I I M I M I I I I I I I I I I I j ! | M 
all 4 MPEASIASITSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLTISVEYGXSG 

10 20 30 40 50 60 

60 70 80 90 100 110 

ml 14 .pep YFIRAAACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGE 
MINIMI! M M : M M M : I : I M I M M I I M 11 M M M I M M M M I Ml 
all 4 YFIRAAACKTGCQGISPSCLNERTVCAVTIKWSSSDTSTSDIACASRLVNMMSSCEGSGE 

70 80 90 100 110 120 

120 130 140 

ml 14 .pep PPGWLCAIIRLSAYSSNASLTISRMX 
M M M M M M M M M M M M M 
all4 PPGWLCAIIRLSAYSSNASLTISRMX 

130 140 



The following partial DNA sequence 

gll7 . seq 



1 


atggtcgacg 


aactcgacCT 


51 


TGCCGACATC 


GGACGCTACG 


101 


GCTGCAACAG 


CACCGTCGCC 


151 


AAACTTACCC 


ACTTCGCCCG 


201 


CGCACAGCAA 


GCGGAAACCA 


251 


Acatccgcgt 


cgtaTTAATC 


301 


ttTTtaaGCA 


ACGCCCCCGA 


351 


aacccTCGAC 


ATCTTCGCCC 


401 


TCAAATGGCA 


GCTCGAAGAT 


451 


TACCGCGAAA 


TCGCCCTGCT 


501 


ATACATCGAA 


AACTTCCTCG 


551 


ATATCCACTT 


TGAAGTCGCC 


601 


AAAAAAATGG 


TGAAGAAAAA 


651 


CGCCGTGCGG 


ATTCTGGTCG 


701 


gcaTCGTCCA 


CAGCCTCTGG 


751 


ATCGCCAACC 


CCAAAGg c a A 


801 


cggcccGGAa 


gacaaaggtg 


851 


accAATTCaa 


CgaatTcggT 


901 


ggcaaaggcg 


attccGCCtA 


951 


CTTGGACTGG 


CGCGAAAATA 


1001 


CCGCCTTCAA 


AACCGAGCTT 


1051 


CACGGCAAAG 


TCCTCTCTCT 


1101 


CTACGCCCTG 


CACAGCAG CA 


1151 


AaggGCAGAT 


TGTGCCGCTG 


1201 


GAAATcatta 


cCGCcaaAGA 


1251 


AGGctgGGtc 


aAATCCGGCA 


1301 


GCCAGcaaAa 


cgCcgaCACC 


1351 


AAGCAGCTTG 


CCAAACTCAC 


1401 


aaATCTCGGC 


tacaaAAAGC 


1451 


gcgaaatttc 


caaccgcgcc 


1501 


ccgccccCCG 


TGCCCGTCAG 



; identified in N. gonorrhoeae <SEQ ID 41 1>: 

GCTGCC CGAT GCCGTCGCCG CCACCCTGCT 
TCCCCGATTG GAACCTATTG GTTTCCGAGC 
GAGCTGGTCA AAGGTGtgga CGAAGTG CAG 
GGTGGACAGC CTCGCCACGC CGGAAGAACG 
TGCGGAAAAT GCTGCTGGCg atggttaccg 
AAACTGGCGA TGCGTacgcg caccCTGcta 
CAGCC CTGAA AAACg eg ccG TCgccaaAga 
CGCTCGCCAA CCGCTTGGGC GTGTGGCAGC 
TTGGGCTTCC GCCATCAAGA ACC CGAAAAA 
TTTGGACGAA AAACGCACCG AACGCCTCGA 
ATATCCTGCG TACGGAACTC AAAAAATACA 
GGCCGTCCGA AACACAT CTA CTCCATTTAC 
ACTCAGCTTC GACGgccTGT TCGACAT CCG 
ATACCGTCCC CGaGTGTTAC ACCACGCTGG 
CAGCCCATTC CCGGCGagtt CGAcgactAC 
CGg 1 1 AT AAA AGtTTGCACA CCGTCATCGT 
tggaaGtgCA AATCCGCACC TTCGAtatGC 
gtcgccgCCC ACTGGCGtta caaagaaggc 
cgaacaaAAA ATcgccTggt TGCgccaACT 
TGG CGGAAAG CGGCAAGGAA GACCTCGCCG 
TT CAACGAC A CGAT TTATGT TTTGACCCCG 
GCCAACGGGC GCAACCCCCA TCGACTTCGC 
TcggCGACCG CTGCCGGGGC GCGAAAGTCG 
TCCACCCCGC TCGAAAACGG ACAGCGCGTC 
AGGG CATCCT TCCGTCAACT GGCTTTACGA 
AGGC CATCGG caaAATCCGC GCCTAcatCC 
GTGCGCGAAG AAGGCCGTGT CCAACTCGAC 
GCCCAAACCC AACCTGCAAG AGCTTgccga 
cagaagacct ctacacCGCc gtcggacaag 
atCcaaaaag cctgcggcac GCTgaacgaa 
CGCAACCACC ATCGTCAAAC AGTCCAAAAT 
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15 51 CAAAAAAGGT GGCAAAACCG GCGTGCTCAT CGACGGCGAA GACGGCTTGA 

16 01 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGATATTGCC 
16 51 GGCTTCGTTA CCCGCGAGCG CGGCATTTCC GTCCACCGCA AAACCTGCCC 
1701 CTCTTTCCGA CACCTTGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 
1751 GTTGGGCGGC GTTGCAGGAA GGGCAAGTGT TCGCCGTCGA TATCGAAATC 
18 01 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 
1851 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 
1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGtCAA CGacCTCCCG 
1951 CGCGTCCTCG CCGGCCTCGG CGATGTCAAA GGCGTATTGA GCGTTACCCG 
2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 412; ORF 1 17.ng>: 

gll7 .pep 

1 MVDELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMRKMLLA MVTDIRWLI KLAM RTRTLL 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YRE I ALLLDE KRTERLEYIE NFLDILRTEL KKYNIHFEVA GRPKHIYSIY 

2 01 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

2 51 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

3 01 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

3 51 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 

4 01 EIITAKEGHP SVNWLYEGWV KSGKAIGKIR AYIRQQNADT VREEGRVQLD 
451 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISKTRA IQKACGTLNE 

5 01 PPPVPVSATT IVKQSKIKKG GKTGVLIDGE DGLMTTLAKC CKPAPPDDIA 
551 GFVTRERGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 
601 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RDLEASMRFT LEVKQVNDLP 
651 RVLAGLGDVK GVLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 413>: 

ml!7.seq (partial) 

1 . . GTGAAACTCA AGAAATACAA TGTCCATTTC GAAGTCGCCG GCCGCCCGAA 

51 ACACATCTAC TC C ATTTACA AAAAAATGGT GAAGAAAAAA CTCAGCTTCG 

101 ACGGCCTCTT TGACATCCGC GCCGTGCGAA TTCTGGTTGA TACCGTCCCC 

151 GAGTGTTACA CCACGCTGGG TATCGTCCAC AGCCTCTGGC AGCCCATTCC 

2 01 CGGCGAGTTC GACGACTACA TCGCCAATCC CAAAGGCAAC GGCTATAAAA 

2 51 GTTTG CAC AC CGTCATCGTC GGCCCGGAAG ACAAAGG CGT GGAAGTACAA 

3 01 ATCCGCACCT TCGATATGCA CCAATTCAAC GAATTCGGTG TCGCCGCCCA 

3 51 C TGg CGTTAC AAAGAGGGCG GCAAGGGCGA TTCCGCCTAC GAACAGAAAA 

4 01 TCGCCTGGTT GCGCCAACTC TTGGACTGGC GCGAAAACAT GGCGGAAAGC 

4 51 GGCAAGGAAG ACCTCGCCGC CGCCTTCAAA ACCGAGCTTT TCAACGACAC 
501 GATTTATGTT TTGACCCCGC ACGGCAAAGT CCTCTCCCTG CCCACGGGCG 

5 51 CGACCCCCAT CGACTTCGCC TACGCCCTGC ACAGCAGCAT CGGCGACCGT 

6 01 TGCCGCGGTG CGAAAGT CGA AGGGCAGATT GTGCCGCTGT CCACCCCGCT 
651 CGAAAACGGA CAGCGCGTCG AAATCATTAC CGC CAAAGAA GGGCATCCTT 

7 01 CCGTCAACTG GCTTTACGAA GGCTGGGTCA AATCCAACAA GGCAATCGGC 
751 AAAATCCGCG CCTACATCCG CCAGCAAAAC GCCGACACCG TGCGCGAAGA 

8 01 AGGCCGCGTC CAACTCGACA AACAGCTTGC CAAACTCACG CCCAAACCCA 
8 51 ACCTGCAAGA GCTTGCCGAA AATCTCGGCT AC AAAAAG C C AGAAGACCTC 
901 TACACCGCCG TCGGACAAGG CGAAATTT CC AACCGCGCCA T CCAAAAAGC 
951 CTGCGGCACg CTGAACGAAC CGCCGCCCGT ACCCGTCAGC GAAACCACCA 

1001 TCGTCAAACA GTCCAAAATC AAAAAAGGCG GCAAAAACGG CGTGCTCATC 

1051 GACGGCGAAG ACGGTCTGAT GACCACGCTT GCCAAATGCT GCAAACCCGC 

1101 GCCGCCCGAC GAT AT TAT CG GCTTCGTTAC CCGCGAGCGC GgCATTTCAG 

1151 TGCACCGCAA AwyyTkCyCG TCTTTCCAAC ACCTCGCCGA ACACGCGCCC 

1201 GAwAAAGTGC TGGACGCAAG CTGGGCGGCA TTGCAGGAAG GACAAGTATT 

1251 CGCCGTCGAT ATCGAAATCC GCGCCCAAGA CCGCTCCGGG CTTTTGCGCG 

13 01 ACGTAT CCG A CGCGCTCGCC CGCCACAAAC TCAACGTTAC CGCCGTGCAA 

13 51 ACCCAGTCCC GCGACTTGGA AGCCAGCATG AGGTTCACGC TCGAAGTCAA 

14 01 ACAAGTCAAC GACCTCCCGC GCGTCCTCGC CAGCCTCGGC GACGTCAAAG 
14 51 GCGTATTGAG CGTTACCCGG CTTTAA 

This corresponds to the amino acid sequence <SEQ ID 414; ORF 1 1 7>: 

mll7.pep (partial) 
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1. . . .VKL.KKYNVHF EVAGRPKHIY SIYKKMVKKK LSFDGLFDIR AVRILVDTVP 

51 ECYTTLGIVH SLWQPIPGEF DDYIANPKGN GYKSLHTVIV GPEDKGVEVQ 

101 IRTFDMHQFN EFGVAAHWRY KEGGKGDSAY EQKIAWLRQL LDWRENMAES 

151 GKEDLAAAFK TELFNDTIYV LTPHGKVLSL PTGATPIDFA YALHSSIGDR 

2 01 CRGAKVEGQI VPLSTPLENG QRVEIITAKE GHPSVNWLYE GWVKSNKAIG 

251 KIRAYIRQQN ADTVREEGRV OLDKQIiAKLT PKPNLQELAE NLGYKKPEDL 

301 YTAVGQGEIS NRAIQKACGT LNEPPPVPVS ETTIVKQSKI KKGGKNGVLI 

351 DGEDGLMTTL AKCCKPAPPD DI IGFVTRER GISVHRKXXX SFQHLAEHAP 

401 X KVLD AS W AA LQEGQVFAVD IEIRAQDRSG LLRDVSDALA RHKLNVTAVQ 

451 TQSRDLEASM RFTLEVKQVN DLPRVLASLG DVKGVLSVTR L* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 117 shows 97.6% identity over a 490 aa overlap with a predicted ORF (ORF 1 17.ng) 
from TV. gonorrhoeae: 

mll7/gll7 

10 20 30 

mll7 pep VKLKKYNVHFE VAGRPKH I YS I YKKMVKKKL 

:||||!:|IIIIMIIMIIIIIIIIMII 
gH7 EKYREIALLLDEKRTERLEYIENFLDILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKL 
150 160 170 180 190 200 

40 50 60 70 80 90 

mll7 pep S FDGL FD I RAVR I L VDTV P E C YTTLG I VH S LWQ P I P GE FDD Y I AN P KGNG YKS LHTV I VG 

III 1 1 1 III MM Ml 1 1 1 MM II 1 1 I I II 1 1 M M MMIMMM M IMM I M II 

g il7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 

100 110 120 130 140 150 

mll7 pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 

MIMI Ml IM MMM IMM II MMMMIMM Mill M IIMIIIIMIM II 

g ll7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 

160 170 180 190 200 210 

mll7 .pep KEDLiAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYAliHSSIGDRCRGAKVEGQIV 

II M 1 1 M II 1 1 1 1 M I M 1 1 1 I II I M II M M I M II II II II I M M M I II I M II 

g 1 1 7 KEDIiAAAFKTELFNDTI YVLTPHGKVLSLPTGATP IDFAYALHS S I GDRCRGAKVEGQI V 

330 340 350 360 370 380 

220 230 240 250 260 270 

mll7.pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 

MM M ! II I II I MM II MM I I I I Mill I MM Ml M I II I Mill I III Ml I I 

g il7 PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSGKAIGKIRAYIRQQNADTVREEGRVQ 
390 400 410 420 430 440 

280 290 300 310 320 330 

ml 17 .pep LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGE I SNRAIQKACGTLNEPPPVPVSE 

M II M I M I II 1 1 M II M II I 1 1 II M M II II M 1 1 M M M II II I II II M II I 

g 1 1 7 LDKQlxAKLTPKPNLQELAENLGYKKPEDLYTAVGQGE I SNRAI QKACGTLNEPP PVPVS A 

450 460 470 480 490 500 

340 350 360 370 380 390 

ml 17 . pep TTIVKQSKI KKGGKNGVLI DGEDGLMTTLAKCCKPAPPDD I IGFVTRERGI SVHRKXXXS 

I MM !! Ml II I I Mi I I I II I ! I I II II II II I III III I M I MM Mill M I 

gll7 TTIVKQSKIKKGGKTGVLIDGEDGLMTTLAKCCKPAPPDDIAGFVTRERGISVHRKTCPS 
510 520 530 540 550 560 



mll7 . pep 



400 410 420 430 440 450 

FQHLAEHAPX KVLDAS WAALQEGQVFAVD I E I RAQDRSGLLRDVSDALARHKLNVTAVQT 
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mll7 . pep 
gll7 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 415>: 
all7 . seq 

1 ATGGTTCATG AACTCGACCT GCTCCCCGAT GCCGTCGCCG CCACCCTGCT 

51 TGCCGACATC GGACGCTACG TCCCCGACTG GAACCTATTG GTTTCCGAAC 

101 GCTGCAACAG TACCGTCGCC GAGCTGGTCA AAGGTGTGGA CGAAGTGCAG 

151 AAACTCACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 

201 CGCCCAGCAG GCAGAAACTA TGCGGAAAAT GCTGCTGGCG ATGGTTACCG 

251 ACATCCGCGT CGTGTTAATC AAACTGGCGA TGCGTACGCG CACCCTGCAA 

301 TTTTTAAGCA ACGCCCCCGA CAGCCCCGAA AAACGCGCCG TCGCCAAAGA 

351 AACCCTCGAC ATCTTCGCCC CGCTCGCCAA CCGTTTGGGC GTGTGGCAGC 

4 01 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 

4 51 TACCGCGAAA TCGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 

501 ATACATCGAA AACTTCCTTA ATATCCTGCG TACGGAACTC AAAAAATACA 

551 ATATCCACTT TGAAGTCGCC GGCCGTCCGA AACACATCTA CTCCATTTAC 

601 AAAAAAATGG TGAAGAAAAA ACTCAGCTTC GACGGGTTGT TCGACATCCG 

651 CGCCGTGCGG ATTCTGGTTG ATACCGTCCC CGAGTGTTAC ACCACACTGG 

701 GCATTGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGAGTT CGACGACTAC 

7 51 ATCGCCAACC CGAAAGGCAA CGGCTATAAA AGTTTGCACA CCGTCATCGT 

801 CGGCCCGGAA GACAAAGGCG TGGAAGTGCA AATCCGCACC TTCGATATGC 

851 ACCAATTCAA CGAATTCGGT GTCGCCGCGC ACTGGCGTTA CAAAGAGGGC 

901 GGCAAAGGCG ATTCCGCCTA CGAACAAAAA ATCGCCTGGT TACGCCAACT 

951 TTTGGACTGG CGCGAAAACA TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCCCT GCCCACAGGC GCGACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TCGGCGACCG TTGCCGCGGT GCGAAAGTCG 

1151 AAGGGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGTGTC 

1201 GAAATCATTA CCGCCAAAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

1251 AGGCTGGGTC AAATCCAACA AGGCAATCGG CAAAATCCGC GCCTACATCC 

1301 GCCAGCAAAA CGCCGACACC GTGCGCGAAG AAGGCCGCGT CCAACTCGAC 

1351 AAACAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTGCCGA 

1401 AAATCTCGGC TACAAAAAGC CAGAAGACCT CTACACCGCC GTCGGACAAG 

14 51 GCGAAATTTC CAACCGCGCC AT C C AAAAAG CCTGCGGCAC GCTGAACGAA 

1501 CCGCCGCCCG TACCCGTCAG CGAAACCACC ATCGTCAAAC AGTCCAAAAT 

1551 CAAAAAAGGC GGCAAAAACG GCGTGCTCAT CGACGGCGAA GACGGTCTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGACATTGTC 

1651 GGCTTCGTTA CCCGCGATCG CGGCATTTCG GTACACCGCA AAACCTGCCC 

17 01 CTCTTTCCGA CACCTCGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

17 51 GTTGGGCGGC GTTGCAGGAA GGACAAGTGT TCGCCGTCGA TATCGAAATC 

1801 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

1851 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAG C C AG CAT GAGGTTCACG CTCGAAGTCA AACAAGTTAC CGACCTCCCA 

1951 CGCGTCCTCG CCAGCCTCGG CGACGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 416; ORF 1 17. a>: 

all7 -pep 

1 MVHELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LAT PEERAQQ AETMR KMLLA MVTDIRWLI KLA MRTRTLQ 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQE PEK 

151 YREIALLLDE KRTERLEYIE NFLNILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLS F DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

2 51 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFKEFG VAAHWRYKEG 

301 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 
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MIMM ! I II I MM I II i I M I Ml I I Mill Mi I I ! I I I I I I I I I : M I I I I i 

FRHLAEHAPEKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 

460 470 480 490 

QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 

II M M M MIM M MM MMM Ml Ml MM Ml III 

QSRDLEASMRFTLEVKQVNDLPRVLAGLGDVKGVLSVTRLX 
630 640 650 660 
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401 EIITAKEGHP 

4 51 KQLAKLTPKP 

501 PPPVPVSETT 

551 GFVTRDRGIS 

601 RAQDRSGLLR 

651 RVLASLGDVK 



SVNWLYEGWV 
NLQELAENLG 
IVKQSKIKKG 
VHRKTCPSFR 
DVSDALARHK 
GVLSVTRL* 



KSNKAIGKIR 
YKKPEDLYTA 
GKNGVLIDGE 
HLAEHAPEKV 
LNVTAVQTQS 



AYIRQQNADT 
VGQGEISNRA 
DGLMTTLAKC 
LDASWAALQE 
RDLEASMRFT 



VREEGRVQLD 
IQKACGTLNE 
CKPAPPDDIV 
GQVFAVDIEI 
LEVKQVTDLP 



mll7/all7 98.0% identity in 490 aa overlap 

10 20 30 

ml 17 .pep VKLKKYN VH FE VAGR PKH I YS I YKKMVKKKL 

: I I i I I : I t I I I I I I I I I I I It I i I I I I I I 
a 117 EKYREIALLLDEKRTERLEYIENFLNILRTELKKYNIHFEVAGRPKHIYS I YKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 80 90 

m!17 .pep SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
I I I I I I I I! I I II M I I i I J I I I M I I I M I I I I I I I I I I I I I I I I I I I I I ! M I I M I I 
a!17 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGE FDDYIANPKGNGYKSLHTVIVG 

210 220 230 240 250 260 

100 110 120 130 140 150 

mll7 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
M | | I I I I I I I M I I I I I II I I I I I If I I I I I I I I I i I I I I I I I I I M I I I I I 1 II I I I I 
all7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 

160 170 180 190 200 210 

mll7 .pep KEDIAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
i I I II I I I I I I I i I I I I I I I I I M I I I I I M II I I I I I I i I I I M II I I 11 1 I I i I I I I I 

all7 KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 

220 230 240 250 260 270 

ml 17 .pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 

I I I I I I I I I I I I M I I I I I 1 M I I I M I I I M M II I M i M 1 I I M I I M I II i II I I I 
all7 PLS T PLENGQR VE 1 1 T AKEGH PS VNWL YEGWVKSNKAI GKI RAY I RQQN ADT VREEGRVQ 

390 400 410 420 430 440 

280 290 300 310 320 330 

mll7 .pep LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
I M I I I I I I I I I M I I I I I I I I I I I I I I I M I I M M I M I I M I I I I I I I I I M I I I If 
all7 LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
450 460 470 480 490 500 

340 350 360 370 380 390 

mll7 . pep TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 
I | | | I I I 1 I I I I I II I M I II I I If I II I II 11 II I I I I I I : I I I I I : I I I I II i I : I 
all7 TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPS 
510 520 530 540 550 560 

400 410 420 430 440 450 

ml 17 .pep FQHLAEHAPXKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
1:1111111 I II I f II I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I if M I I I I I I I! 
all7 FRHLAEHAPEKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 

460 470 480 4 90 

ml 17 .pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 
I I I It I I I I I I I If I I I I : I I I I I II I I I I I I I I I I I I I I I 
all7 QSRDLEASMRFTLEVKQVTDLPRVLASLGDVKGVLSVTRLX 
630 640 650 660 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 41 7>: 

gll7-l . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CCCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTCGCTGG CGCAGGAACA TTATCCTGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTCGACGAAC TCGACCTGCT GCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGATTGGAA CCTATTGGTT 

301 TCCGAGCGCT GCAACAGCAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTTACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

4 01 AAGAACGCGC ACAGCAAGCG GAAACCATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT ATTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCTGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG CTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCGATA TCCTGCGTAC GGAACTCAAA 

751 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CAT TTAC AA A AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTGTTCG 

851 ACATCCGCGC CGTGCGGATT CTGGTCGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGCA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGagttCGA 

951 cgactACATC GCCAACCCCA AAGgcaACGg ttATAAAAGt TTGCACACCG 

1001 TCATCGTcgg cccGGAagaa aaaggtgtgg aagtgcAAAT CCGCACCTTC 

1051 GATATGCacc AATTCaaCga ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAAGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAATATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCTCTGCC AACGGGCGCA ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGCTG CCGGGGCGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

14 01 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCGGCAAGG CCATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGTGTCCA 

1551 ACTCGACAAG CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTGC CCGTCAGCGC AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGTGGC AAAACCGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGCTTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTGCCGGC TTCGTTACCC GCGAGCGCGG CATTTCCGTC CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTTGCCGAAC ACGCGCCCGA AAAAGTACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGG CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCG GCCTCGGCGA TGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 418; ORF 117-l.ng>: 

gll7-l .pep 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WSLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VDELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LAM RTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLDILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVI VGPEE KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTI YVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

4 51 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SGKAIGKIRA 
501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

5 51 GQGEISNRAI QKACGTLNEP PPVPVSATTI VKQSKIKKGG KTGVLIDGED 
601 GLMTTLAKCC KPAPPDDIAG FVTRERGISV HRKTCPSFRH LAEHAPEKVL 
651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 
701 DLEASMRFTL EVKQVNDLPR VLAGLGDVKG VLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 419>: 

mll7-l . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 
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51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTTGCTGG CGCAGGAACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

4 01 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

451 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAAAGCC 

651 CGAAAAATAC CGCGAAATCG CGCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCAACA TCCTGCGCGG TGAACTCAAG 

751 AAATACAATG TCCATTTCGA AGTCGCCGGC CGCCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTCTTTG 

851 ACATCCGCGC CGTGCGAATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGTA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAATCCCA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTACAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAGGGCGGC AAGGGCGATT CCGCCTACGA ACAGAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

12 51 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACGGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTATCGGC TTCGTTACCC GCGAGCGCGG CATTTCAGTG CACCGCAAAA 

1901 CCTGCCCGTC TTTCCAACAC CTCGCCGAAC ACGCGCCCGA AAAAGTGCTG 

1951 GACGCAAGCT GGGCGGCATT GCAGGAAGGA CAAGTATTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 



This corresponds to 

mll7-l .pep 

1 MTAISPIQDT 
DAATPYGEPL 
SERCNSTVAE 
151 VTDIRVVLIK 
2 01 WQLKWQLEDL 
251 KYNVHFEVAG 
301 TLGIVHSLWQ 
351 DMHQFNEFGV 
4 01 LAAAFKTELF 
451 KVEGQIVPLS 
501 YIRQQNADTV 
551 GQGEISNRAI 
601 GLMTTLAKCC 
651 DASWAALQEG 
701 DLEASMRFTL 



the amino acid sequence <SEQ ID 420; ORF 1 17-1 > 



51 
101 



QSATLQELRE 
PDHFLGAAQM 
LVKGVDEVQK 
LAMRTRTLQF 
GFRHQKPEKY 
RPKHIYSIYK 
PIPGEFDDYI 
AAHWRYKEGG 
NDTIYVLTPH 
TPLENGQRVE 
REEGRVQLDK 
QKACGTLNEP 
KPAPPDDIIG 
QVFAVDIEIR 
EVKQVNDLPR 



WFDSYCAALP 
VHELDLLPDA 
LTHFARVDSL 
LSNAPDSPEK 
REIALLLDEK 
KMVKKKLSFD 
ANPKGNGYKS 
KGDSAYEQKI 
GKVLSLPTGA 
IITAKEGHPS 
QLAKLTPKPN 
PPVPVSETTI 
FVTRERGISV 
AQDRSGLLRD 
VLASLGDVKG 



DNDKNLIGTA 
VAATLLADIG 
ATPEERAQQA 
RAVAKETLDI 
RTERLEYIEN 
GLFDIRAVRI 
LHTVIVGPED 
AWLRQLLDWR 
TPIDFAYALH 
VNWLYEGWVK 
LQELAENLGY 
VKQSKIKKGG 
HRKTCPSFQH 
VSDALARHKL 
VLSVTRL* 



WLLAQEHYPA 
RYVPDWNLLV 
ETMR KMLLAM 
FAPLANRLGV 
FLNILRGELK 
LVDTVPECYT 
KGVEVQIRTF 
ENMAESGKED 
SSIGDRCRGA 
SNKAIGKIRA 
KKPEDLYTAV 
KNGVLIDGED 
LAEHAPEKVL 
NVTAVQTQSR 



mll7-l/gll7-l 98.2% identity in 737 aa overlap 



10 20 30 40 50 60 

ml 17-1. pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
I I I I I I I I I t I I I I I I I I I 1 I ! 1 I I I I I I I I I I I I M I I I 1 I M I I I I I I IS I I I I i 1 I 
g 1 1 7 - 1 MTAIS P I QDTQS ATLQELREWFDS YCAALPDNDKNL IGTAWSLAQEH YPADAAT PYGEPL 

10 20 30 40 50 '60 



70 



80 



90 100 110 120 
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ml 17-1 .pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
I ! I I I I I I I I I I I I I i I I I I I I t 1 I I I I I I I I I I I I I I I I i I I I ( I I I I I I | | | | | | j ! 
g 117-1 PDHFLGAAQMVDELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 17-1 . pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
I I i I I f I I I I I I ! I i i M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I i I 
g!17-l LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 1 7 - 1 . pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREI ALLLDEKRTERLEYI EN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I : I i I I I I I I I I I I 1 I I It I I I I I I I 
g 1 17 - 1 RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEKYREIALLLDEKRTERLEYIEN 

190 200 210 220 230 240 

250 260 270 280 290 300 

mll7-l.pep FLNILRGELKKYNVHFEVAGRPKHIYSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
11:11! i I I I 1 I : I I I t ! I I I ! I i I I I I I i I I I M M I ! t I I I I I I I I I I I M I I I I M 
gll7-l FLDILRTELKKYNIHFEVAGRPKHIYSIYKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 1 7 - 1 . pep TLGI VHSLWQPI PGEFDDYIANPKGNGYKSLHTVI VGPEDKGVEVQIRTFDMHQFNEFGV 
I I I II I I I I I I M I I I I II I II I I I I I I I I I ! I I I I I I I : I I I I I I I | | | | | | | | | | | | | 
gll7-l TLGI VHSLWQPI PGEFDDY I ANPKGNGYKSLHT VI VGPEEKGVEVQIRTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 17-1 .pep AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDT I YVLTPH 
I I I I I I I 11 I I I ! I t I I I I I I I I I I I I I I I 1 I I I I i t I II I I I I I I I I I I I I II I I I I | I 
g 1 1 7 - 1 AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 17-1 .pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I | | | | 
g 1 17 - 1 GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEI ITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 17-1 .pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I I I M 1 I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I | I I | I | | | | | | | | | | | | | | | M 
gll7-l VNWLYEGWVKSGKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 1 7 - 1 . pep KKPEDLYTAVGQGEI SNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLI DGED 
I I I I I N M I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I : I IN I I | | 
gll7-l KKPEDLYTAVGQGEI SNRAIQKACGTLNEPPPVPVSATTIVKQSKIKKGGKTGVLI DGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

m!17-l.pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I : I I I I I I I I I | | | | | | | | | | I I 
g 1 1 7 - 1 GLMTTLAKCCKPAP PDDI AGFVTRERG I S VHRKTC PS FRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 17-1 . pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I | M || | | | 
gl 1 7 - 1 QVFAVDI E I RAQDRSGLLRDVS DALARHKLN VTAVQTQS RDLEASMRFTLEVKQVNDL PR 

670 680 690 700 710 720 

730 

ml 17-1 .pep VLASLGDVKGVLSVTRLX 
I I I : I I I I I I I I I I I II I 
gl 1 7 - 1 VLAGLGDVKGVLS VTRLX 

730 

mll7-l/RelA 
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spl P55133 |RELA_VIBSS GTP PYROPHOSPHOKINASE (ATP : GTP 3 ' -PYROPHOSPHOTRANSFERASE) (PPGPP 
SYNTHETASE I) >gi| 537617 (U13769) ppGpp synthetase I (Vibrio sp. ] Length = 744 
Score = 536 bits (1366), Expect = e-151 

Identities - 288/685 (42%), Positives = 432/685 (63%), Gaps = 31/685 (4%) 

Query; 74 LDLLPDAVAATLLADI GRYVPDWNLLVSERCNSTVAELVKGVDEVQKLTHFARVDSL 130 

L + D + A LL + GYD +E+T+ LV+GV+++ ++ ++ S 
Sbjct: 68 LSMDADTLIAALLYPLVEGGCYSTD ALKEEYSGTILHLVQGVEQMCAIS QLKST 121 

Query: 131 ATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEKRAVAKETLDI 190 

A +A Q + +R+MLL+MV D R V+IKLA R L+ + + PD +RA A+E +1 

SbjCt: 122 AEETAQAAQVDNIRRMLLSMVDDFRCVVIKLAERICNLREVKDQPDEV-RRAAAQECANI 180 

Query: 191 FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIENFLNILRGELK 250 

+APLANRLG+ QLKW++ED FR+Q P+ Y++IA L E+R +R +YI +F++ L +K 
Sbjct: 181 YAPLANRLGIGQLKWEIEDYAFRYQHPDTYKQIAKQLSERRIDREDYITHFVDDLSDAMK 240 

Query: 251 KYNVHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQ 310 

N+ EV GRPKHI YSI++KM KK L FD LFD+RAVRI+ + + +CY LG+VH+ ++ 
SbjCt: 241 ASNIRAEVQGRPKHI YSIWRKMQKKSLEFDELFDVRAVRIVAEELQDCYAALGVVHTKYR 300 

Query: 311 PIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEG- 369 

+P EFDDY+ANPK NGY+S+HTV++GPE K +E+QIRT MH+ +E GVAAHW+YKEG 
SbjCt: 301 HLPKEFDDYVANPKPNGYQSIHTVVLGPEGKTIEIQIRTKQMHEESELGVAAHWKYKEGT 360 

Query: 370 — GKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPHGKVLSLP 4 27 

G SAY++KI WLR+LL W+E M++SG ++ ++++F+D +Y TP G V+ LP 

SbjCt: 361 ASGGAQSAYDEKINWLRKLLAWQEEMSDSG — EMLDELRSQVFDDRVYAFTPKGDVVDLP 418 

Query: 428 TGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEI ITAKEGHPSVNWLYE- 486 

+ ATP+DFAY +HS +G RC GAKVEG+IVP + L+ G +VEIIT KE +PS +WL 
SbjCt: 419 SNATPLDFAYHIHSEVGHRCIGAKVEGRIVPFTYHLQMGDQVEIITQKEPNPSRDWLNPN 478 

Query: 487 -GWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKL — TPKPNLQELAENLGYKKP 54 3 

G+V S++A K+ A+ R+Q+ D G+ L+ +L K+ T K A+ K P 

SbjCt: 479 LGFVTSSRARAKVHAWFRKQDRDKNIIAGKEILEAELVKIHATLKDAQYYAAKRFNVKSP 538 

Query: 544 EDLYTAVGQGEIS-NRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKNGV 594 

E+LY +G G++ N+ I +N+P + + K S+ KK ++ V 

SbjCt: 539 EELYAGIGSGDLRINQVINHINALVNKPTAEEEDQQLLEKLSEASNKQATSHKKPQRDAV 598 

Query: 595 LIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASW 654 

+++G D LMT LA+CC+P P DDI GFVT+ RGISVHR C + L HAPE+++D W 
Sbjct: 599 VVEGVDNLMTHLARCCQPI PGDDI QG FVTQGRGISVHRMDCEQLEELRHHAPERI I DTVW 658 

Query: 655 AALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQ— SRDLEASMRFTLEV 712 

G + + + + A +R+GLL+++++ L K+ V ++++ + + M F LE+ 
SbjCt: 659 GGGFVGN-YTITVRVTASERNGLLKELTNTLMNEKVKVAGMKSRVDYKKQMSIMDFELEL 717 



Query: 713 KQ VN DL PRVL A S LGD VKGVL S VT RL 737 

+ L RVL + VK V RL 
Sbjct: 718 TDLEVLGRVLKRI EQVKDVAEAKRL 742 

The following partial DNA sequence was 

all7-l . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCAC 

101 AAAAACTTGT CTTAGCCGCC CGTTCGCTGG 

151 GATGCCGCCA CGCCGTATGG CGAACCGCTG 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT 

401 AAGAACGCGC CCAGCAGGCA GAAACTATGC 

451 GTTACCGACA TCCGCGTCGT GTTAATCAAA 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT 

701 GCCTCGAATA CATCGAAAAC TTCCTTAATA 

7 51 AAATACAATA TCCACTTTGA AGTCGCC GGC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT 



identified in N. meningitidis <SEQ ID 42 1>: 

CAAAGCGCGA CTCTGCAAGA 
CGCGCTGCCG AACAACGATA 
CGGAAGCACA TTACCCCGCC 
CCCGACCACT TCCTCGGCGC 
CCCCGATGCC GTCGCCGCCA 
CCGACTGGAA CCTATTGGTT 
CTGGTCAAAG GTGTGGACGA 
GGACAGCCTC GCCACGCCGG 
GGAAAATGCT GCTGGCGATG 
CTGGCGATGC GTACGCGCAC 
CCCCGAAAAA CGCGCCGTCG 
TCGCCAACCG TTTGGGCGTG 
GGCTTCCGCC ATCAAGAACC 
GGACGAAAAA CGCACCGAAC 
TCCTGCGTAC GGAACTCAAA 
CGTCCGAAAC ACATCTACTC 
CAGCTTCGAC GGGTTGTTCG 
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851 ACATCCGCGC CGTGCGGATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACACTGGGCA TTGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAACCCGA AAGGCAACGG C TAT AAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTGCAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCGCACT GGCGTTACAA 

1101 AGAGGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTAC 

1151 GCCAACTTTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACAGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

14 01 GCGTGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGC C AG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 C C AAAATC A A AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 CATTGTCGGC TTCGTTACCC GCGATCGCGG CATTTCGGTA CACCGCAAAA 

1901 CCTGCCCCTC TTTCCGACAC CTCGCCGAAC ACGCGCCCGA AAAAGT ACTG 

1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGA CAAGTGTTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTTACCGA 

2151 CCTCCCACGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 422; ORF 1 17-l.a>: 

all7-l .pep 

1 MTAISPIQDT QSATLQELRE WFDSYCTALP NNDKKLVLAA RSLAEAHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LAMRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLNILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIVG FVTRDRGISV HRKTCPSFRH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVTDLPR VLASLGDVKG VLSVTRL* 

all7-l/mll7-l 97.7% identity in 737 aa overlap 

10 20 30 40 50 60 

ml 17-1 . pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
I I I I I I I I I I I II I I I I I I I I I I I I I : I I I : I I I : I : : I j I : M I I I I I I I I I I t I 
a 1 1 7 - 1 MTAI S P I QDTQS ATLQELREWFDS YCTALPNNDKKLVLAARSLAEAH YPADAATP YGEPL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 17-1 . pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
I I 1 I I I I M I I I I t f I I 1 I I I I i I I I I ! I I I I I II I 1 I I i I I I I I | | | | | | | I | | | | | | f 
a 1 1 7 - 1 PDHFLGAAQMVHELDLLPDAVAATLLADI GRYVPDWNLLVSERCNSTVAELVKGVDEVQK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 17-1 .pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
I I I I I I I I M I I I M I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II I | | I | | | | | | | | I | 
a 1 1 7 - 1 LTHFARVDSLATPEERAQQAETMRKMLLAMVTDI RVVLI KLAMRTRTLQFLSNAPDS PEK 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 1 7 - 1 . pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQK PEK YREI ALLLDEKRTERLEYIEN 
I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I | I I | | | | | I M 
all7-l RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEK YREI ALLLDEKRTERLEYIEN 

190 200 210 220 230 240 
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250 260 270 280 290 300 

mll7-l .pep FLNILRGELKKYNVHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
MM!) I I I I j !: I I I II i I I I I I I I I I I I I I I I I I M ! I I I I I I I I I I I M I I I I I I I 
al!7-l FLNILRTELKKYNIHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

mll7-l .pep TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 
I I I I I I I I I I I I I I I I I I I I I t I I I I! I I I I I i I I I I I I I I I I I II II I I I I I I I I I I I I 
all7-l TLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

mll7-l .pep AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I t I I I I I t I I I I I I I I I I I I I I I 
all7-l AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 
370 380 390 400 410 420 

430 440 450 460 470 480 

mll7-l .pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I I 1 I I I I I I I I I I I I I t I I II I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I i I I I 
a 1 1 7 - 1 GKVLSL PTGATPI DFAYALHSS IGDRCRGAKVEGQI VPLSTPLENGQRVEI I TAKEGHPS 

430 440 450 4 60 470 480 

490 500 510 520 530 540 

ml 17-1 .pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I I I 1 I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I M I I I I I I I 
all7-l VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 17-1 .pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 
I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
all7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 17-1 .pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I I I I I I I I I I I I I I I I I : 1 I I I I : f I I II I I II I I I I : I I I I I I I I I f I I I I I 1 I I I I I 
a 1 17 - 1 GLMTTLAKCCKPAPPDDIVGFVTRDRGI S VHRKTCPS FRHLAEHAPEKVL DAS WAAL QEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 17-1 .pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I I I I I I I M I I f I I I I I I I I I I I 1 I I I I I I I I I I I I 1 I I I I I I I I I I 11 I : I I I I 
all7-l QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVTDLPR 

670 680 690 700 710 720 

730 

ml 1 7 - 1 . pep VLASLGDVKGVLS VTRLX 
M I I I I I I I I I I I I I I I I 
a 1 1 7 - 1 VLAS LG DVKGVLS VTRLX 

730 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 423>: 

gll8 . seq 

1 ATGTGCGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGCTG GAGAATGATT TAATcgaGGT TAGGAGAAAA 

151 T AT CCTT AT C C G AT G GAT AT ACCAAGGGAT ATTGTGATTG G7AATCGGTAC 

2 01 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

2 51 CTTCCCCTTG GTTGCCTGAT AGCGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAaCA AAA7AATAG 

This corresponds to the amino acid sequence <SEQ ID 424; ORF 1 18.ng>: 

gll8 . pep 

1 MCEFKDFRRN I PCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRRK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 425>: 

mll8 . seq 

1 ATGTGTGAGT TCAAGGATAT TATAAGAAAC GTTCCTTATT TTGAGGGGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGTTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGACAT ACCAAGATAT GTTGTCATTG GAATCGGTAC 

201 CATTATTGAT TTCTTAATGG TTCCAAATTG GAAACTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 426; ORF 1 18>: 

mll8 . pep 

1 MCEFKDIIRN VPYFEGYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRY VVIGIGTIID FLMVPNW KLF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 118 shows 92.8% identity over a 125 aa overlap with a predicted ORF (ORF 1 18.ng) 
from N. gonorrhoeae: 

mll8/gll8 

10 20 30 40 50 60 

ml 18 . pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 

t I I I II : I } : i II I I I II M I I I I I I I I I I I I M I i I I I I M I M : I I I i I I ! I I I 
gll8 MCEFKDFRRNIPCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRRKYPYPMDIPRD 

10 20 30 40 50 60 

70 80 90 100 110 120. 

ml 18 .pep VVIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: M M t I I I t I It I I I I : M ! I I I 11 I I I I I I I I I M I I I I I i I M I ! I I I I i I I i I I I I 
gll8 IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 100 110 120 



ml 18. pep YYNKKX 
I \ I t I I 

gll8 YYNKKX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 427>: 
all8 . seq 

1 ATGTGTGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAATTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT G AAAT T AAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 428; ORF 1 18.a>: 

all8 .pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 



ml 18 /all 8 93.6% identity in 125 aa overlap 

10 20 30 40 50 60 

ml 18. pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDIPRY 

I I I I I I : 11:1 II I I I M M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
all 8 MCEFKDFRRN I PCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKK YPYPMDIPRD 

10 . 20 30 40 50 60 
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70 80 90 100 110 120 

mll8.pep VVIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: I I I I I I I I I I I I I I I I : I M I I I I I I I M I M I I I I I I I I i I I I M I M I I I II I I I I I 
all 8 IVIGIGTIIDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 100 110 120 



ml 18. pep YYNKKX 
I I I I M 

all 8 YYNKKX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 429>: 

gl20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAAGGCTACC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACAT TTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAATCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTGCCTACT 

251 ATAAAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

3 01 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

3 51 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

4 01 CGAAACTCCC CCCGGGTCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 
451 GTCGGCGGCC TGAATAAGGC GGGTACGGGA AAATACAGCA Taggcggcgt 

5 01 gGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATACGGTAA 
551 CGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 
601 ACCGAcgaCG GCAAAACCTA TACGCTGAAG CTCAAATCGG TGCAGATCAA 
651 CGGACAGGCC GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 430; ORF 120.ng>: 

gl20 .pep 

1 MMKTFKNIFS AAILSAALPC AYA ARLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PAYYKDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DTVTYFFAPS LNNIPAQIGY 

2 01 TDDGKTYTLK LKSVQINGQA AKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 43 1>: 

ml20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGmACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCcAA ATTCGCCGAC 

3 01 GGCAGCGTAA CTTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

3 51 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 
401 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

4 51 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 
501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 
551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

6 01 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 
651 CGGCCAGGCA GCCAAACCG 

This corresponds to the amino acid sequence <SEQ ID 432; ORF 120>: 

ml20 .pep 

1 MMKTFKNIFS AAILSAALPC AYA AGLPQSA VLXYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

2 01 TDDGKTYTLK LKSVQINGQA AKP 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 120 shows 97.3% identity over a 223 aa overlap with a predicted ORF (ORF 120.ng) 
from N. gonorrhoeae: 

ml20/gl20 

10 20 30 40 50 60 

ml 2 0 . pep MMKTFKNI FS AAI LSAALPCAYAAGLPQSAVLXYSGS YGI PATMTFERSGNAYKI VSTI K 

IIIIIIIIIIIIIIIIMIIIIII lllllll IIIIMIIIIIIIIIIIIIIIIIIMI 

gl2 0 MMKTFKNI FS AAI LSAALPCAYAARLPQS AVLHYSGS YGI PATMTFERSGNAYKI VSTI K 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml20 . pep VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

I : M il Ml I I I I II II I Ml:' hlllMII II; II llllll II I I I I I! M MINI I 

gl20 VPLYNIRFESGGTWGNTLHPAYYKDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml20 . pep DLFTLAWQLAANDAKLPPGLKI TNGKKLYSVGGLNKAGTGKYS I GGVETE WKYRVRRGD 

I III II III MINIMI llllllllll II II I II II MINI lllllll II I I MINI 
g 1 2 0 DLFTLAWQLAANDAKLP PGLKI TNGKKLYSVGGLNKAGTGKYS I GGVETE WKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml20 . pep DAVMYFFAPS LNNI PAQ IGYTDDGKTYTLKLKS VQ INGQAAKP 

hi h III llllll II I IIIIMIhllMI II: llllhh 

gl20 DTVTYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 433>: 

al20 . seq 

^ 1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC G AC GAT T AAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

301 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

4 01 CGAAACTCCC CCCGGGGCTG AAAAT C AC C A ACGGCAAAAA ACTTTATTCC 

4 51 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAAT AC AG C A TAGGCGGCGT 

501 GGAAACCGAA GTCGT CAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 434; ORF 120.a>: 
al20 .pep 

1 MMKTFKNI FS AAILSAALPC AYA AGLPQSA VLHYSGSYGI PATMT FERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYS I GGVETE WKYRVRRGD DAVMYFFAPS LNNI PAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

ml20/al20 99.6% identity in 223 aa overlap 

10 20 30 40 50 60 

ml20 . pep MMKT FKN I FS AA I LSAALPCAYAAGLPQSAVLXYSGS YGI PATMTFERSGNAYKI VST IK 

I I I It I I M E I I I I It I I I I I I I I I I I I I I j I I I I I I I I I I I I I I I I I I I I I I I I I | I I 
al2 0 MMKT FKNIFSAAILSAALPCAYAAGLPQSAVLHYSGSYGIPATMT FERSGNAYKIVSTIK 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml20 . pep VPLYNIRFESGGTVVGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 
I I I I M I I I I I I I I I I I I I M M I I I I I I If I I II I I I I I I I I I i M II I I I I I I I I M I 
al20 VPLYNIRFESGGTVVGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml20 . pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNPCAGTGKYSIGGVETEVVKYRVRRGD 
I I I I M I I I I I I I I I I 1 I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I 
a 120 DLFTLAWQLAANDAKLPPGLKITKGKKLYSVGGLNKAGTGKYSIGGVETEVVKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml20.pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I M I I 
a 120 DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 435>: 

0121 . seq 

1 ATGGAAACAC 

51 GGCGGATGCC 

101 AAGGGCACGC 

151 GATTTGCAGG 

2 01 GCAAGAACTC 

251 GTCAAAACCT 

301 ACCGTCCGAC 

351 GCCGCTGCTG 

4 01 GCCGCGACCT 

4 51 CACGAAGCCC 

501 CGGCGGGATT 

551 GCTTCGACAC 

601 cacTGGcagc 

651 catatTGCcg 

701 AACCCcaccc 

751 gaaacctAcc 

801 ttcccgattc 

851 CAGATGCCCG 

901 TTAATGGCGG 

951 CACCGCCGAA 

1001 cgtggttggC 

1051 GCGACCGGCG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 436; ORF 121.ng>: 

gl21 .pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRVVLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

2 01 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

2 51 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 43 7>: 

ml 21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 



AGCTTTACAT CGGCATTATG 
GTGCTGGTAC GGATGGACGG 
CTTTACCCCC TACCCTGACC 
ACACAGGCAC AGACGAACTG 
AGCCGCCTGT ACGCGCAAAC 
CGCTCCGTGC GACATTACCG 
ACGCGCCGGA ACACGGTtac 
GCGGAACTGa cgcggatttT 
TGCTGCCGGC GGacaAGGTG 
TGTTCCGCGA TGACAGGGAA 
GCCAACATCA GCGTACTCCC 
AGGGCCGGGC AATATGCTGA 
TGCCTTACGA CAAAAacggt 
cAACTGCTCG gcaggctGCT 
aaAAAGCACG GGgcGCGaac 
ttgacggcgg cgaaaaccga 
accgcgcaaA ccgTttggga 
TCAAATGTAC ATTTGCGGCG 
ATTTGGCAGA ATGTTTCGGC 
CTGAACCTCG ATCCTCAATG 
GGCGTGTTGG ATTAACCGCA 
CATCCAAACC GTGTATTCTG 



TCGGGAACCA GTATGGACGG 
CGGCAAATGG CTGGGCGCGG 
GGTTGCGCCG CAAATTGCTG 
CACCGCAGCA GGATGTTGTC 
CGCCGCCGAA CTGCTGTGCA 
CCCTCGGCTG CCACGGGCAA 
AGCATACAGC TTGCCGATTT 
TACCGTCggc gacttcCGCA 
CGCCGCTCGT CCCCGCCTTT 
ACACGCGTGG TACTGAACAT 
CCCCGGCGCA CCCGCCTTCG 
TGGAcgcgtg gacgcaggca 
gcAAAGgcgg cacAAGGCAA 
CGCCcaccCG TATTTCTCAC 
TgtttgcccT AAattggctc 
tacgacgtat tgcggacgct 
cgccgtctca CACGCAGCGG 
GCGGCATCCG CAATCCTGTT 
ACACGCGTTT CCCTGCACAG 
GGTGGAGGCG gccgCATTtg 
TTCCCGGTAG T C CGC AC AAA 
GGCGCGGGAT AT TAT TAT T G 
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151 


GATTTGCAGG 


201 


GCAAGAACTC 


251 


GTCAAAACCT 


301 


ACCGTCCGAC 


351 


GCCGCTGCTG 


401 


xxxxxxxxxx 


451 


xxxxxxxxxx 


501 


xxxxxxxxxx 


551 


xxxxxxxxxx 


601 


xxxxxxCAGC 


651 


CATATTGCCG 


701 


AACGCCACCC 


751 


GAAACCTACC 


801 


TTCCCGTTTT 


851 


CAGATGCCCG 


901 


TTAATGGCGG 


951 


CACCGCCGAC 


1001 


CGTGGTTGGC 


1051 


GCAACCGGCG 


1101 


A 



ACACAGGCGC 
AGCCGCCTAT 
CGCACCGTCC 
ACGCGCCGGA 
GCGxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
TTCCTTACGA 
CAACTGCTCG 
TAAAAGCACG 
TTGACGGCGG 
ACCGCGCAAA 
TCAAATGTAC 
ATTTGGCAGA 
CTGAACCTCG 
GGCGTGTTGG 
CATCCAAACC 



AGACGAACTG 
ATGCGCAAAC 
G AC AT T AC CG 
ACACGGTTAC 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
CAAAAACGGT 
ACAGGCTGCT 
GGGCGCGAAC 
CGAAAACCGA 
CCGTTTGCGA 
ATTTGCGACG 
ATGTTTCGGC 
ATCCGCAATG 
ATTAATCGCA 
GTGTATTCTG 



CACCGCAGCA 
CGCCGCCGAA 
CCCTCGGCTG 
AGCATACAGC 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
GCAAAGTCGG 
CGCCCACCCG 
TGTTTGCCAT 
TACGACGTAT 
CGCCGTCTCA 
GCGGCATCCG 
ACACGCGTTT 
GGTGGAAGCC 
TTCCCGGTAG 
AnCGCGGGAT 



GGATTTTGTC 
CTGCTGTGCA 
CCACGGGCAA 
TTGCCGATTT 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
CACAAGGCAA 
TATTTCGCAC 
AAATTGGCTC 
TGCGGACGCT 
CACGCAGCGG 
CAATCCTGTT 
CCCTGCACAG 
GCCGnATTTG 
TCCGCACAAA 
ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 438; ORF 121>: 

ml21 -pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

Computer analysis of the amino acid sequences gave the following results: 

Homology with a predicted ORF from meningitidis menA with menB 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 

from TV. gonorrhoeae: 

ml21/gl21 

10 20 30 40 50 60 

ml 21 .pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
M I I I I I I II I I I I I II 1 1 I I I : I I I I I I 1 I I I M I I I I 1 I I I I I I : I I II I I II : I I I 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFT PYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I | | I : | I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I It I II I I I I I I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
g 121 AELTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 

: : I I I I I II I I I : I I I I I I I I I I I I I I II I I : I Mill 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I If I I I I I I I I I I I I I I I I I I I I I I I I I I 
g!21 GRE LFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVS HAAADARQMY I CGGG I RNPV 

250 260 270 280 290 300 
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310 320 330 340 350 360 

ml 21 .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAX FAWLAACWINRI PGSPHKATGASKPCIL 

I M | I ) I I I I ! I 1 II I I I I : I I I I I M ! I I I I I I II ! I I I I I I I I I I II I I I I I I I I I I 
g 121 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRI PGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 439>: 

al21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

3 51 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 
4 51 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 
501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 
551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 
601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 
651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 
751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 
8 01 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 
851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATT ATT ATT G 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 440; ORF 121.a>: 

al21 .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAE GRAFT P YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAP LVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

2 01 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

2 51 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21/al21 74.0% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
1 I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II f I I (I M I I I I I : I 1 11 I I I I I I I I 
a 121 METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I : I I : I I I I I M I 
al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml 21 .pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
a 121 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

- ml 21 -pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 

: I M I I M I I I : I I ] I I t It I I I I I I M I II II I I I I I 

a 121 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21 .pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I M : I M I I I M I I II I I I I I I I I I I I I I I I I I I I M I I I ! I I II I II I I I II M 
a 121 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
i I I I I I I I I M I I I I I I I I : I M I I I I I I I I I I I : I I I I : I I I I I I I E I M II I I M II 
al21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
MIME 

al21 GAGYYYX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 441>: 

m.121-1 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT AT TAT T ATT G 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 442; ORF 121-1>: 

ml21-l .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRXFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAEC FG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 95.6% identity in 366 aa overlap 

10 20 ' 30 40 50 60 
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ml21 -1 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

I | | | I | I I I I ! I I I I I I I I I I I : I I I t I I I I I I I I I ! 1 I I I I I I I I : I M I I 1 I I : I I 1 
g 121 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21-l .pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
! I } I : I I 1 I I I I I I I ) I ) I ) 1 i I ) I I ) I I I I I I I I I I I I I I I I I I 1 I I I 1 I I I I t I I I i 
g 121 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l .pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 

II I I I I I I I I I I i i I I I ( I I I I II I I I f 1 I f I M i I :( I I I : I I I M I I [ I I I I M I I 
g 121 AELTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRVVLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKKGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I ! I I | M ! I I 1 I I I I I I I 1 I I I I 1 I I I t I 1 I I I I I I I I I I I I I I I I I I I I I : I I I I I II 
g 1 2 1 PA FG FDTG PGNMLMDAWTQAHWQLP YDKNGAKAAQGN I L PQLhGRLh AH P Y FS QPH PKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l - pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I M | | | I | I I 1 I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I | | J | | ! I M II I I I I I t I : I I I I ! I I I I I i I ] I I I I 1 I I I I I I I I I I I I I I I II 1 I I I 
g 121 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml 2 1 - 1 . pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 443>: 

al21-l. seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

7 51 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 444; ORF 121-1 ,a>: 

al21-l .pep 

X METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 
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201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21-l/al21-l 96.4% identity in 366 aa overlap 

10 20 30 40 50 60 

m!21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I t I I I I t I I I I I I I I 1 I I I I I I I M I 1 I I I I I I I I 1 I I 1 I I I I I I • I I I I I I 1 I I I I I 
al2 1- 1 METQLYIG1MSGTSMDGADAVLI RMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 2 1 - 1 . pep HRSRI LSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYS I QLADLPLL 
} M I : I I I I II I ) il 1 II I I I I I J I It I I I i I I I I I t I I I I I I I I I I I : I I : I I I I ! I I I 
al2 1- 1 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 - 1 . pep AERTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNI GGI ANI SVLPPDA 
I | { | : | I I i I I t I I I I I i I I I I I I I I II I I I i I I I I f : II I I I f I I I I II I i I I I I I I I I 
al21-l AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIAN I SVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
1 I I t I I I I It I I I II I I I I I I I I I I I I i I 1 I I 1 I I I I I I I I i I I I I I I I I I I I 1 I I I I t 
al21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l . pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I I I [ 1 I I 1 I I I i I M I I I I I I I I I I I 1 I I II M I I I E I I I I I I I I I 1 I I I I I I I 1 I I I I 
a 12 1-1 GRELFALNWLETYLDGGENR YD VLRTLSRFTAQTVFDAVSHAAADARQMY ICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 2 1-1 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
| I | | | | I | | I I I I i I I I I I : I I I I I I 1 I 1 I I I I I : I I I I : I I I I I I I I I I I I I I I I I I I 
a 1 2 1 LMADLAECFGTRVSLHS TAELNLDPQWVEAAAFAWMAACWVNRI PGS PHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 445>: 

gl22 - seq 

1 ATGGCTTTAC TGAG CATCCG CAAGCTGCAC AAACAATACG GCAGCGTAAC 

51 CGCCATCCAA TCCTTAGACT TGGACTTGGA AAAAGGCGAA GtcatCGTAC 

101 TGCTGGGCCC gTccggctgc ggCAAATCCA CCCTcctgcg ctgcgtcaaC 

151 GGTTTGGAGC CGCACCAagg eg GCAGCATC GTGATGGACG GTgtcgGCGA 

201 ATTCggcAAA GACGTTTCCT GGCAAACCGC CCGGCAAAAa gtcggtatgg 

251 tctttcaaag taacgAactg Tttgcccaca tgacegtcat cgAaaacatc 

3 01 ttcttAggcC CGGTAAagga aCAAAAcCgc gaccgtgccg aagcaGAGGC 

351 gCAAGCCGGC AAactGttgg aacgcgTCGG actgctAGAC CGCAAAAACG 

401 CCTATCCGCG CGAACTTTCC GGCGGTCAGA AACAG CGCAT CGCCATTGTC 

451 CGCGCCCTGT GCCTGAATCC GGAAGTCATC CTGCTGGACG AAATCACCGC 

501 CGCACTTGAC CCCGAAATGG TGCGCGAAGT CTTGGAAGTG GTTTTGGAAC 

551 TCGCCCGCGA AGGGATGAGT ATGCTCATCG TAACCCACGA AATGGGGTTC 

6 01 GCACGCAAAG TTGCCGACCG CATCGTCTTT ATGGACAAAG GCGGCATCGT 

651 CGAATCGTCC GACCCCGAAA CCTTTTTTTC CGCACCAAAA AGCGAACGCG 

701 CCCGCCAATT TCTGGCAGGT ATGGACTACT GA 

This corresponds to the amino acid sequence <SEQ ID 446; ORF 122.ng>: 
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gl22 -pep 

1 MALLSIRKLH KQYGSVTAIQ SLDLDLEKGE VIVLLGPSGC GKSTLLRCVN 

51 GLEPHQGGSI VMDGVGE FGK DVSWQTARQK VGMVFQSNEL FAHMTVIENI 

101 FLGPVKEQNR DRAEAEAQAG V KLLERVGLLD RKNAYPRELS GGQKQRIAIV 

151 RALCLNPEVI LLDEITAALD PEMVREVLEV VLELAREGMS MLIVTHEMGF 

2 01 ARKVADRIVF MDKGGIVESS DPETFFSAPK SERARQFLAG MDY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 447>: 

ml22 . seq 

1 GTTGTCATGA TTAAAATCCG 

51 TATTTTGCGC GGCATCGATT 

101 TCCTCGGGcC TTCCGGCTCA 

151 GCGTTGGAAA TGCCCGAAGA 

201 GCTGAAAATC GATTTTTCTA 

251 TGCGCCGCAA ATCAkGCATG 

3 01 AAAACCGCCT TGGAAAACGT 

3 51 GCCTGCCGCC CAAGCGCGCG 

4 01 GCTTGGGCGA CAAAGTGGAT 
4 51 CAGCAGCGCG TCGGCATTGC 
501 GCTGTTTGAC GAACCGACTT 
551 TTTTGGATmC CATGAAGGAA 
601 GTTACGCATG AAATCAAGTT 
651 GATGGACrGC GGCGTTATTG 
701 ACCACCCCAA ACACGAACGG 
751 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 448; ORF 122>: 

ml 2 2 . pep 

1 WMIKIRNIH KTFGENTILR GIDLDVCKGQ VWILGPSGS GKTTFLRCLN 
51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSXM VFQQYNLFPH 
101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 
151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLDXMKE LAQEGWTMW 
201 VTHEIKFALE VATTXWMDX GVIVEQGSPQ DLFDHPKHER TRRFLSQIQS 
251 TKI* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 122 shows 47.2% identity over a 246 aa overlap with a predicted ORF (ORF 122.ng) 
from N. gonorrhoeae: 

ml22/gl22 

10 20 30 40 50 60 

ml22 .pep 
9122 



ml22 .pep 
g!22 



ml22 . pep 
g!22 



ml22 .pep 
gl22 



CAATATCCAT 
TGGATGTGTG 
GGCAAAACGA 
CGGACAAATC 
AAAAACCAAG 
GTGTTTCAAC 
AATGGAAGGA 
AAGAGGCTCT 
TTGTATCCCT 
CCGCGCATTG 
CCGCGCTCGA 
TTGGCG CAAG 
CGCCTTAGAA 
TCGAACAAGG 
ACG CGGAGAT 



AAG AC CTTTG 
CAAAGGGCAG 
CGTTTCTGCG 
GAGTTCGACA 
CAAACACGAT 
AATACAAyCT 
CCGGTTGCCG 
GAAAC TGCTG 
ACCAGCTTTC 
GCGATTCAGC 
TCCTGAATTG 
AAGGCTGGAC 
GTGGCAACCA 
CAGCCCGCAA 
TTTTAAGCCA 



GCGAAAACAC 
GTGGTCGTCA 
ATGCCTAAAC 
ACGAGCGACC 
ATTTTGGCAC 
CTTTCCGCAC 
TACAgGGCAA 
GAAAAAGTCG 
CGGCGGTCAG 
CTGAACTGAT 
GTGCAAGATG 
CATGGTTGTC 
CCGwCGTCGT 
GATTTGT TCG 
AATCCAATCT 



WMI KI RNI HKTFGENTI LRGI DLDVCKGQVVVI LG PS GS GKTTFLRCLN ALEMPEDGQ I 
= ::::||::|| =| I = = : : I I I = I I = I = I = I I I M I I = | : || | : | : || : |:| 
MALLS I RKLHKQYGSVTAIQSLDLDLEKGEVI VLLGPSGCGKSTLLRCVNGLEPHQGGS I 
10 20 30 40 50 60 

70 80 90 100 110 120 

EFDNERPLK I DFSKKPSKHD I LALRRKSXMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 

:|= : II- hi II I i = : I I I h : I I = = I I I I- I 

VMDGVGEFGKDVSWQTA RQKVGMVFQSNELFAHMTVI ENI FLG P VKEQNRDRA 

70 80 90 100 110 

130 140 150 160 170 180 

QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 
:h =1 Nihil I h = II : I I I I h I h : h I I I - II- hi I hi II I h 
EAEAQAGKLLERVGLLDRKNAYPRELSGGQKQRIAIVRALCLNPEVILLDEITAALDPEM 
120 130 140 150 160 170 

190 200 210 220 230 240 

VQDVLDXMKEIAQEGWTMVWTHEIKFALEVATTXVVMDXGVIVEQGSPQDLFDHPKHER 
h = lh : I I hi I :h:|||h II :|| | || | ||h-h - I = II || 

VREVLEWLELAREGMSMLIVTHEMGFARKVADRIVFMDKGGIVESSDPETFFSAPKSER 
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180 190 200 210 220 230 



ml22 . pep 
gl22 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 449>: 

al22 . seq 

1 GTTGTCATGA TTAAAATCCG 

51 CATTTTGCGC GGCATCAATT 

101 TCCTCGGGCC TTCCGGCTCA 

151 GCGTTGGAAA TGCCCGAAGA 

201 GCTGAAAATC GATTTTTCTA 

251 TGCGCCGCAA ATCAGGCATG 

301 AAAACCGCCT TGGAAAACGT 

3 51 GCCTGCCGCC CAAGCGCGCG 

4 01 GCTTGGGCGA CAAAGTGGAT 
4 51 CAGCAGCGCG TCGGCATTGC 
501 GTTGTTTGAC GAACCCACTT 
551 TGTTGAACGC CATGAAGGAA 
601 GTTACCCACG AAATCAAGTT 
651 GATGGACGGC GGCGTTATCG 
701 ACCACCCCAA ACACGAACGG 
7 51 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 450; ORF 122.a>: 

al22.pep 

1 WMIKIRNIH KTFGKNTILR GINLDVCKGQ VVVILGPSGS GKTTFLRCLN 

51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSGM VFQQYNLFPH 

101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 

151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLNAMKE LAREGWTMVV 

2 01 VTHEIK FALE VATTWVMDG GVIV EQGSPK ELFDHPKHER TRRFLSQIQS 

251 TKI* 

m!22/al22 96.0% identity in 253 aa overlap 

10 20 30 40 50 60 

ml22 . pep VVMIKIRNIHKTFGENTILRGIDLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQI 
I I I I I I I I I M I I I : M i I I I I : I I I I I I I I I I I II I I I I I I If I I I! ! I I I I I I I I I M 
al22 VVMIKIRNIHKTFGKNTILRGINLDVCKGQVVVILGPSGSGKTTF-LRCLNALEMPEDGQI 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml22 . pep EFDNERPLKI DFSKKPSKHDILALRRKSXMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 

I I II II I I I I I I I I M M I I I I I I I I I I II I I I I I I I I 1 I I I I I I I II I I I i I I I I | M 
al22 EFDNERPLKI DFSKKPSKHD I LALRRKSGMVFQQYNLFPHKTALENVMEG PVAVQGKPAA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 2 . pep QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 
I I I I I I I! I I I I I I I I I I I II I I I I I I I M I I I I I M I I I I I t I I M I I I I I I I I I I I M 
al22 QAREEALKLLE KVGLG DKVDLY P YQLSGGQQQRVG I ARAL AI Q PE LML FDE PT S ALD PE L 

130 140 150 160 170 180 

190 200 210 220 230 24 0 

ml22 . pep VQDVLDXMKELAQEGWTMVVVTHEIKFALEVATTXWMDXGVIVEQGSPQDLFDHPKHER 
Mill: I I I I I : I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I :: I I I I M I I I 
a 1 2 2 VQDVLNAMKE LAREGWTMV VVTHE I KFALEVATT VWMDGGV I VEQGS PKEL FDH PKHER 

190 200 210 220 230 240 



250 

TRRFLSQIQSTKIX 
= 1 = 11 = 

ARQFLAGMDYX 
240 



CAATATCCAT 
TGGATGTGTG 
GGCAAAACGA 
CGGACAAATC 
AAAAACCAAG 
GTGTTTCAAC 
GATGGAAGGA 
AAGAGGCTCT 
TTGTATCCCT 
CCGAGCATTG 
CCGCGCTTGA 
TTGGCGCGGG 
CGCGCTGGAA 
TAGAGCAGGG 
ACGCGGAGAT 



AAGACCTTCG 
CAAAGGGCAG 
CGTTTCTGCG 
GAGTTCGACA 
CAAACACGAT 
AATACAACCT 
CCGGTTGCCG 
GAAACTGCTG 
ACCAGCTTTC 
GCGATTCAGC 
CCCCGAGTTG 
AAGGTTGGAC 
GTTGCCACGA 
CAGCCCGAAA 
TTTTAAGCCA 



G C AAAAAT AC 
GTGGTCGTCA 
ATGCCTAAAC 
ACGAGCGACC 
ATTTTGGCAC 
CTTTCCGCAC 
TACAGGGCAA 
GAAAAAGTCG 
CGGCGGTCAG 
CCGAGCTGAT 
GTGCAAGACG 
GATGGTCGTC 
CCGTTGTCGT 
GAGTTGTTCG 
AATCCAATCT 



ml22.pep 



250 

TRRFLSQIQSTKIX 
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al22 



I I M I I I I I II I I I 
TRRFLSQIQSTKIX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 1>: 

gl22-l . seq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACCATTTT 

51 GCGCGGCATC GATTTGGATG TGGGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCGGGTAAA ACAACATTTC TGCGCTGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGCGC GGCCGTTACG 

201 CATTGATTTT TCCAAAAAAA CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAGTCCGG AATGGTATTC CAACAATACA ACCTCTTCCC GCATAAAACC 

301 GTGTTGGAAA ACGTGATGGA AGGGCCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTTTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGATAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

4 51 CGTGTCGGTA TCGCCCGCGC ACTGGCGATT CAGCCTGAAT TGATGCTGTT 

501 TGACGAACCC ACTTCCGCGC TGGACCCCGA GTTGGTGCAA GACGTGTTGG 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCACGCT GGAAGTTGCC ACGAACGTCG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 TCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTGCCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 452; ORF 122-1. ng>: 

gl22-l.pep 

1 MIKIRNIHKT FGENTILRGI DLDVGKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNARPLRIDF SKKTSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 VLENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDAMKELA REGWTMVWT 

201 HEIKFTLEVA TNVVVMDGGV IVEQGSPKEL FDHLKHERTR RFLSQIQSAK 

251 I* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 453>: 

ml22-l . seq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACTATTTT 

51 GCGCGGCATC GATTTGGATG TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCAGGCAAA ACGACGTTTC TGCGATGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGAGC GACCGCTGAA 

201 AATCGATTTT TCTAAAAAAC CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAATCAGG CATGGTGTTT CAACAATACA ACCTCTTTCC GCACAAAACC 

301 GCCTTGGAAA ACGTAATGGA AGGACCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGACAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGCGTCGGCA TTGCCCGCGC ATTGGCGATT CAGCCTGAAC TGATGCTGTT 

501 TGACGAACCG ACTTCCGCGC TCGATCCTGA ATTGGTGCAA GATGTTTTGG 

551 ATACCATGAA GGAATTGGCG CAAGAAGGCT GGACCATGGT TGTCGTTACG 

601 CATGAAATCA AGTTCGCCTT AGAAGTGGCA ACCACCGTCG TCGTGATGGA 

651 CGGCGGCGTT ATTGTCGAAC AAGGCAGCCC GCAAGATTTG TTCGACCACC 

701 CCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTACCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 454; ORF 122-1>: 

ml22-l .pep 

1 MIKIRNIHKT FGENTILRGI DLDVCKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDTMKELA QEGWTMVVVT 

201 HEIKFALEVA TTVVVMDGGV IVEQGSPQDL FDHPKHERTR RFLSQIQSTK 
251 I* 

ml22-l/gl22-l 94.8% identity in 251 aa overlap 



10 20 30 40 50 60 



m!22 



l.pep 



MIKIRNIHKT FGENTILRGIDLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 
I I I I I I I I I t I I I I 1 I I I I I I I I I | | | | I I I | I | | | | | | | | | | | | | | | | | i | | | | ! | | | 
MIKIRNIHKTFGENTILRGIDLDVGKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 



gl22 



1 



WO 99/57280 



PCT/US99/09346 



352 



10 20 30 40 50 60 

70 80 90 100 110 120 

ml22-l .pep DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 
|| MCIIIIII I I I I II I I I I I I I I I I I I I I I I I M I : I I II t I I II i I I I I I I I I I 
ql22-l DNARPLRIDFSKKTSKHDILALRRKSGMVFQQYNLFPHKTVLENVMEGPVAVQGKPAAQA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml22-l-pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
| I I I I I I I I I I I I I I I I I I I I I I I I I E I I M 1 I I I I I I I I I I 1 I I I I I IE I I I I I I I I I I 
gl22-l REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml22-l pep DVLDTMKELAQEGWTMVVVTHEIKFALEVATTVVVMDGGVIVEQGSPQDLFDHPKHERTR 
I I I I : I I I I I : I I I I I M I I 1 I I I I : I M I I : I I I I I I I I I I I I I M - I I I I MUM 
gl22-l DVLDAMKELAREGWTMVVVTHEIKFTLEVATNVVVMDGGVIVEQGSPKELFDHLKHERTR 

190 200 210 220 230 240 



ml22-l .pep 
g!22-l 



250 

RFLSQIQSTKIX 
11111111:111 
RFLSQIQSAKIX 
250 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 455>: 



a!22-l.seq 



ATGATTAAAA 
GCGCGGCATC 
GGCCTTCCGG 
151 GAAATGCCCG 
201 AATCGATTTT 
GCAAATCAGG 
GCCTTGGAAA 
CGCCCAAGCG 
GCGACAAAGT 
CGCGTCGGCA 
TGACGAACCC 
ACGCCATGAA 
CACGAAATCA 
CGGCGGCGTT 
7 01 CCAAACACGA 
751 ATTTGA 



1 
51 
101 



251 
301 
351 
401 
451 
501 
551 
601 
651 



TCCGCAATAT 
AATTTGGATG 
CTCAGGCAAA 
AAGACGGACA 
TCTAAAAAAC 
CATGGTGTTT 
ACGTGATGGA 
CGCGAAGAGG 
GGATTTGTAT 
TTGCCCGAGC 
ACTTCCGCGC 
GGAATTGGCG 
AGTTCGCGCT 
ATCGTAGAGC 
ACGGACGCGG 



CCATAAGACC 
TGTGCAAAGG 
ACGACGTTTC 
AATCGAGTTC 
CAAGCAAACA 
CAACAATACA 
AGGACCGGTT 
CTCTGAAACT 
CCCTACCAGC 
ATTGGCGATT 
TTGACCCCGA 
CGGGAAGGTT 
GGAAGTTGCC 
AGGGCAGCCC 
AGATTTTTAA 



TTCGGCAAAA 
GCAGGTGGTC 
TGCGATGCCT 
GACAACGAGC 
CGATATTTTG 
ACCTCTTTCC 
GCCGTACAGG 
GCTGGAAAAA 
TTTCCGGCGG 
CAGCCCGAGC 
GTTGGTGCAA 
GGACGATGGT 
ACGACCGTTG 
GAAAGAGTTG 
GCCAAATCCA 



ATACCATTTT 
GTCATCCTCG 
AAACGCGTTG 
GACCGCTGAA 
GCACTGCGCC 
GCACAAAACC 
GCAAGCCTGC 
GTCGGCTTGG 
TCAGCAGCAG 
TGATGTTGTT 
GACGTGTTGA 
CGTCGTTACC 
TCGTGATGGA 
TTCGACCACC 
ATCTACCAAG 



This corresponds to the amino acid sequence <SEQ ID 456; ORF 122-1. a>: 

al22-l .pep 

1 MIKIRNIHKT FGKNTILRGI NLDVCKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLNAMKELA REGWTMWVT 

201 HEIKFALEVA TTVVVMDGGV IVEQGSPKEL FDHPKHERTR RFLSQIQSTK 

251 I* 



al22-l/ml22-l 97.2% identity in 251 aa overlap 

10 20 30 40 50 60 

al22-l .pep MIKIRNIHKTFGKNTILRGINLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 
| | | I | I I I I I j I : I I I I I I I : I I I E I I I I II I I I I I I 1 I I I I I I I t I I I I I I I M I I I I I 
ml22-l MIKIRNIHKTFGENTILRGIDLDVCKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 

10 20 30 40 50 60 



70 80 90 100 110 120 

al22-l .pep DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 
t I I I I i I I I I I I I I i I E M I I I I I I I I I I I I I I I I i I I I I I I I 1 I I I I It I i I I I I I I I I 
ml 2 2 - 1 DNERPLKI DFSKKPS KHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 

70 80 90 100 110 120 

130 140 150 160 170 180 

a 12 2 -1 . pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 

I 1 I I M I M I I t I I t I I I I 1 II I I I I I I I I II I I I I I I I I I I I I ! I I J I I I I I II I I 1 I I 
ml22-l REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
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130 140 150 160 170 180 

190 200 210 220 230 240 

al22-l .pep DVLNAMKELAREGWTMVVVTHEIKFALEVATTVVVMDGGVIVEQGSPKELFDHPKHERTR 
I I I : : I I I I I : t I I I I II ! I I I I I M I I I I I I I I 11 I I I I I I 1 I I t I : : I I I I I I I I I I I 
ml22-l DVLDTMKELAQEGWTMVVVTHEIKFALEVATTVVVMDGGVIVEQGSPQDLFDHPKHERTR 

190 200 210 220 230 240 

250 

al22-l.pep RFLSQIQSTKIX 
I I I I I I I I I I I I 
ml22-l RFLSQIQSTKIX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 457>: 

gl25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGGT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TCGCCCCCTT GGGCTGGCAG CGCGGTCTGG CGGCCCTGCT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GTGTGCGCCT GTCGTTCGGC AAATGCGGTT 

2 51 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGTCGGCGC AacggTCAGC TCCGCTTTGG GCAAAGTGTT 

351 GTGGGACggc gaATCCTTTG TCTGGTGGGC ATTGGCAAAC GGCGCACTGA 

4 01 TCGTGCTGTG GCTGGTTTTC GGCGCACGCA GAACGGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GCTTGCCGTG TTGTGGTTGA GCGTCGAAGT 

501 GTTCGCTTCG TCCGGCACAA ACGCCGCGCC CGCCGTTTCA GACGGCATGA 

551 CCTTCGGAAC GGCAGTCGAA CTGTCCGCCG TCATGCCGCT TTCCTGGCTG 

601 CCGCTGGCCG CCGACTACAC GCGCCAAGCA CGCCGCCCGT TTGCGGCAAC 

651 CCTGACGGCA ACGCTCGCCT ATACG CTGAC GGGCTGCTGG ATGTATGCCT 

701 TGGGTTTGGC GGCGGCTCTG TTTAC CGGAG AAACCGACGT GGCGAAAATC 

751 CTGTTGGGCG CGGGCTTGGG CATAACGGGC ATTCTGGCAG TCGTCCTCTC 

801 CACCGTTACC ACAACGTTTC TCGATACCTA TTCCGCCGGC GCGAGTGCGA 

8 51 ACAACATTTC CGCGCGTTTT GCGGAAATAC CCGTCGCTGT CGGCGTTACC 

901 CTGatccgca ccgtgcttgc cgtcatgctg cccgttaccg aatataaaaa 

951 cttcctgctg cttatccgct cggtatttgg gccgatggcg ggtggttttg 

1001 attgccgaCT TTTttgtctt AAAACGGCGT GA 

This corresponds to the amino acid sequence <SEQ ID 458; ORF 125.ng>: 



The following partial DNA sequence 



m!2 5 . seq 






1 


ATGTCGGGCA 


ATGCCTCCTC 


51 


TTGGTTCGGC 


GCGGCGGTAT 


101 


TTGCGCCTTT 


GGGCTGGCAG 


151 


GCCGTCGGCG 


GCGCGCTGTT 


201 


CGGACGCAGC 


TCGATGGAAA 


251 


CAGTGCTGTT 


TTCCGTGGCG 


301 


GTGATGATTT 


ACGCCGGCGC 


351 


GTGGGACGGC 


GAATCTTTTG 


401 


TTGTGCTGTG 


GCTGGTTTTC 


451 


GTTTCGATGC 


TGCTGATGCT 


501 


CTTTTCCACG 


GCAGGCAGCA 


551 


TCGGAACGGC 


AGTCGAGCTG 


601 


CTTGCCGCCG 


ACTACACGCG 


651 


GACGGCAACG 


CTCGCCTACA 


701 


GTTTGGCAGC 


GGCGTTGTTC 


751 


CTGGGCGCAr 


GTTTGgGTGC 


801 


CGTTACCACA 


ACGTTTCTCG 



identified in N. meningitidis <SEQ ID 459>: 

TCCTTCATCT TCCTCCGCCA TCGGGCTGAT 
CGATTGCCGA AATCAGCACG GGTACGCTGC 
CGCGGTCTGG CGGCTCTACT TTTGGGTCAT 
TTTTGCGGCG GCGTATATCG GCGCACTGAC 
GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 
AATATGCTGC AACTGGCCGG CTGGACGGCG 
AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 
TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 
GGCGCACGCA AAACAGGCGG GCTGAAAACC 
GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 
CCGCCGCACA GGTTTCAGAC GGCATGAGTT 
TCCGCCGTGA TGCCGCTTTC CTGGCTGCCG 
CCACGCGCGC CGCCCGTTTG CGGCAACCCT 
CGCTGACCGG CTGCTGGATG TATGCCTTGG 
ACCGGAGAAA CCGACGTGGC AAAAATCCTG 
GGCAGGCATT TTGGCGGTCG TCCTCTCCAC 
ATGCCTATTC CGCCGGCGCG AGTGCGAACA 
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851 


ACATTTCCGC 


GCGTTTTGCG 


GAAACACCCG 


TCGCTGTCrG 


CGTTACCCTG 


901 


ATCGGCACGG 


TACTTGCCGT 


CATGCTGCCC 


GTTACCGAAT 


ATGAAAACTT 


951 


CCTGCTGCTT 


ATCGGCTCGG 


TATTTGCGCC 


GATGGCGGgC 


GGTTTTGATT 


1001 


GCCGACTTTT 


TCGTCTTGAA 


ACGGCGTGA 






This corresponds to the amino acid sequence <SEQ ID 460; ORF 


125>: 


ml 2 5 .pep 












1 


MSGNASSPSS 


SSAIGLIWFG 


AAVSIAEIST 


GTLLAPLGWQ 


RGLAALLLGH 


51 


AVGGALFFAA 


AYI GALTGRS 


SMESVRLSFG 


KRGSVLFSVA 


NMLQLAGWTA 


101 


VMIYAGATVS 


SALGKVLWDG 


ESFVWWALAN 


GALIVLWLVF 


GARKTGGLKT 


151 


VSMLLMLLAV 


LWLSAEVFST 


AGSTAAQVSD 


GMS FGTAVEL 


SAVMPLSWLP 


201 


LAADYTRHAR 


RPFAATLTAT 


LAYTLTGCWM 


YALGLAAALF 


TGETD VAKIL 


251 


LGAXLGAAGI 


LAWLSTVTT 


T F LD AY SAGA 


SANNISARFA 


ETPVAVXVTL 


301 


IGTVLAVMLP 


VTEYENFLLL 


I GS VFAPMAG 


GFDCRLFRLE 


TA* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 125 shows 92.1% identity over a 343 aa overlap with a predicted ORF (ORF 125.ng) 
from N. gonorrhoeae: 

ml25/gl25 

10 20 30 40 50 60 

MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

Ml 1 Mlil Ihl II hi I IMIIIIMill! Mi ! II II I M MMIMI M I III III 

M SGNAS SPSS S AAI GLVW FGAAVS I AE I S TGTLLAPLGWQRGLAALLLGHAVGG ALF FAA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AY I GALTGRS SMES VRLS FGKRGSVLFSVANMLQLAGWTAVM I YAGATVS SALGKVLWDG 

1 1 1 1 I 1 1 I I I II 1 1 I 1 1 M 1 1 1 [ I 1 1 I I I I 1 1 I I I I ! I 1 1 1 1 1 • I I I 1 1 ] II I t I M 1 1 

AY I GALTGRS SMES VRLS FGKCGS VLFS VANMLQLAGWTAVM I YVGATVS SALGKVLWDG 
70 80 90 100 110 120 

130 140 150 160 . 170 179 

ESFVWWALANGALI VLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQ - VS 

MMIIIMIIIMIMIMM|:|MMMIIIIIIIIIMM:|||:::|::M I I 
ESFVWWALANGALIVLWLVFGARRTGGLKTVSMLLMLLAVLWLSVEVFASSGTNAAPAVS 
13 0 14 0 15 0 160 170 180 

180 190 200 210 220 230 239 

DGMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAAL 

MMM M III II III I Ml Ml MUM M M M II I I Ml MM I II I II M I I! M 

DGMTFGTAVELSAVMPLSWLPLAADYTRQARRPFAATLTATLAYTLTGCWMYALGLAAAL 
190 200 210 220 230 240 

240 250 260 270 280 290 299 

FTGE TDVAKI LLG AX LGAAG I LAWLS TVTT T FLD A YS AGAS ANN I S AR F AE T P VAVX VT 

Mllllllllllll II MM I MM M I I Ml I M Ml I IM II I MM I Nil I I 

FTGETD VAKI LLGAGLG I TG I LAWLS TVTTTFLDT Y SAGAS ANN I SARFAE I P VAVGVT 
250 260 270 280 290 300 

300 310 320 330 340 

LIGTVLAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 

I! MINIMI IIMIIIII If I = M 1 1 1 1 1! 1 1 I hill 

L I RTVLAVMLPVTE YKNFLLLI RS VFGPMAGGFDCRLFCLKTAX 
310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 46 1>: 

al25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACACTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTGCT TTTGGGTCAT 



ml 2 5 .pep 
9125 

ml25 -pep 
gl25 

ml 2 5 . pep 
gl25 

ml25 -pep 
gl25 

ml25 .pep 
gl25 

ml2 5 .pep 
gl25 
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151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 



GCCGTCGGCG 
CGGACGCAGC 
CAGTGCTGTT 
GTGATGATTT 
GTGGGACGGC 
TTGTGCTGTG 
GTTTCGATGC 
CTTTTCCACG 
TCGGAACGGC 
CTGGCCGCCG 
GACGGCAACG 
GTTTGGCAGC 
CTGGGCGCAG 
CGTTACCACC 
ATATTTCCGC 
GTCGGCACAC 
CCTGCTGCTT 
GCCGACTTTT 



GCGCGCTGTT 
TCGATGGAAA 
TTCCGTGGCG 
ACGCCGGCGC 
GAATCTTTTG 
GCTGGTTTTC 
TGCTGATGCT 
GCAGGCAGCA 
AGTCGAGCTG 
ACTACACGCG 
CTCGCCTACA 
GGCGTTGTTC 
GTTTGGGTGC 
ACTTTTCTCG 
CAAACTTTCG 
TGCTTGCCGT 
ATCGGCTCGG 
TCGTCTTGAA 



TTTTGCGGCG 
GCGTGCGCCT 
AATATGCTGC 
AACGGTCAGC 
TCTGGTGGGC 
GGCGCACGCA 
GTTGGCGGTT 
CCGCCGCACA 
TCCGCCGTGA 
CCACGCGCGC 
CGCTGACCGG 
ACCGGAGAAA 
GGCAGGCATT 
ATGCCTACTC 
GAAATACCCA 
CCTCCTGCCC 
TATTTGCGCC 
ACGGCGTGA 



GCGTATATCG 
GTCGTTCGGC 
AACTGGCCGG 
TCCGCTTTGG 
ATTGGCAAAC 
AAACAGGCGG 
CTGTGGCTGA 
GGTTTCAGAC 
TGCCGCTTTC 
CGCCCGTTTG 
CTGCTGGATG 
CCGACGTGGC 
TTGGCGGTCG 
CGCCGGCGTA 
TCGCCGTTGC 
GTTACCGAAT 
GATGGCG . GC 



GCGCACTGAC 
AAACGCGGTT 
CTGGACGGCG 
GCAAAGTGTT 
GGCGCGCTGA 
GCTGAAAACC 
GTGCCGAAGT 
GGCATGAGTT 
TTGGCTGCCG 
CGGCAACCCT 
TATGCCTTGG 
AAAAATCCTG 
TCCTGTCGAC 
AGTGCCAACA 
CGTCGCCGTT 
ATGAAAACTT 
GGTTTTGATT 



This corresponds to the amino acid sequence <SEQ ID 462; ORF 125. a>: 

al25 . pep 

1 MSGNASSPSS SAAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGGALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FV WW ALAN GALIVLWLV F GARKTGGLKT 

151 VSMLLMLLAV LWLSAEV FST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETD VAKIL 

251 LGAGLGAAGI LAVVL STVTT TFLDAYSAGV SANNISAKLS E IPIAVAVAV 

301 VGTLLAVLLP VTEYEN FLLL IGSVFAPMAX GFD CRLFRLE TA* 



ml25/al25 95.6% identity in 342 aa overlap 

10 20 30 40 50 60 

ml25 . pep MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 
I | | I I I M I I I : I I I I II I I I I I II M I I I I I I I II I 1 I I I I I I I I I I I I I I I I I I I I If 
al25 MSGNASSPSSSAAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml25 . pep AYIGALTGRSSMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVSSALGKVLWDG 
I I | I I I II M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
al25 AYIGALTGRS SMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVSSALGKVLWDG 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml25 . pep ESFVWWALANGAL I VLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFST AGSTAAQVSD 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M 1 I I I I I II I I I ! I 
al25 E SFVWWALANGAL I VLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFST AGSTAAQVSD 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml25 . pep GM SFGT AVE LSAVMPLSWLPLAADYTRHARRPFAATLT AT LAYTLTGCWM YALGLAAALF 

(I M I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I 
al25 GM S FG T AVE L S A VM P L S W L P LAA D YT R HARR P F AAT LT AT LAY T LT G C WM Y ALG L AAAL F 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml25 . pep TGETDVAKILLGAXLGAAGILAVVLSTVTTTFLDAYSAGASANNISARFAETPVAVXVTL 
I t 1 I II I I I I M I I I I I M I I I I I I I I I I I M M I I I I : I I I II 1 I ::: I I : I I I : : 
al2 5 TGETDVAKILLGAGLGAAGILAWLSTVTTTFLDAYSAGVSANNISAKLSEIPIAVAVAV 

250 260 270 280 290 300 



310 320 330 340 

ml 2 5 . pep IGTVLAVMLPVTEYENFLLLIGSVFAPMAGGFDCRLFRLETAX 
: I I : I I I : I I I I I I I I II I I I II I I I I I I I II I i II H I II I 
al25 VGTLLAVLLPVTEYENFLLLIGSVFAPMAXGFDCRLFRLETAX 
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310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 463>: 

gl26 . seq 

1 AtgccgtcTG AAaccCcaaa ggcACGCCGC CGG CTTTCAG ACGGCATCGC 

51 GTCCGACAAC CATACCAAAG AAT C CATC AT GCTCACCctg tacggcGAAA 

101 CTTTCCCTTC GCGGCTGCTg ctcggcacgG cggcctacCC G AC CCCTGAA 

151 ATC CTCAAAC AATCCGTCCG AACCGCCCGG CCCGCGATGA ttaccGTCTC 

201 GCTGCGCCGC ACGGGATGCG GCGGCGAGGC GCACGGTCAG GGGTTTTGGT 

2 51 CGCTGCTTCA AGAAACCGGC GTTCCCGTCC TGCCGAACAC GGCAGGCTGC 

3 01 CAAAGCGTGC AGGAAGCGGT. AACGACGGCG CAAATGGCGC GCGAAGTGTT 
351 TGAAACCGAT TGGATAAAAT TGGAACTCAT CGGCGACGAC GACACCTTGC 

4 01 AGCCGGACGT GTTCCAACTC GTCGAAGCGG CGGAAATCCT GATTAAAGAC 
4 51 GGCTTCAAAG TGCTGCCTTA TTGCACCGAA GACCTGATTG CCTGCCGCCG 
501 CCTGCTCGAT GCGGGCTGTC AGGCGTTGAT GCCGTGGGCG GCTCCCATCG 
551 GCACGGGTTT GGGGGCGGTT CACGCCTATG CGCTCAAAAT CCTGCGCGAA 
601 CGCCTGCCCG ACACGCCGCT GATTATCGAC GCGGGCTTGG GTTTGCCTTC 
651 CCAAGCGGCA CAAGTGATGG AATGGGGTTT TGACGGCGTA TTGTTAAACA 
701 CCGCCGTTTC CCGCAGCGGC GACCCCGTCA ACATGGCGCG CGCCTTCGCA 
751 CTCGCCGTCG AATCCGGACG GCTGGCATTT GAAGCCGGGC CGGTCGAAGC 
801 G CGAACC AAA GCCCAAGCCA GCACGCCGAC AGT CGGACAA CCGTTTTGGC 
851 ATTCGGCGGA ATATTGA 



This corresponds to the amino acid sequence <SEQ ID 464; ORF 126.ng>: 

gl26 .pep 

1 MPSETPKARR RLSDGIASDN HTKESIMLTL YGETFPSRLL LGTAAYPTPE 

51 ILKQSVRTAR PAMITVSLRR TGCGGEAHGQ GFWSLLQETG VPVLPNTAGC 

101 QSVQEAVTTA QMAREVFETD WIKLELIGDD DTLQPDVFQL VEAAEILIKD 

151 GFKVLPYCTE DLIACRRLLD AGCQALMPWA APIGTGLGAV HAYALKI LRE 

201 RLPDTPLIID AGLGLPSQAA QVMEWGFDGV LLNTAVSRSG DPVNMARAFA 

251 LAVE S GRLAF EAGPVEARTK AQASTPTVGQ PFWHSAEY* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 465>: 

ml26 . seq (partial ) 

1 . . CACTATACAA AGGAACCCAT TATGCTCACC CTATACGGCG AAACTTTCCC 

51 CTCGCGGCTG CTGCTCGGCA CGGCTG CCTA CCCGACCCCC GAAATCCTCA 

101 AACAATCCAT CCAAACCGCC CAGCCTGCGA TGATTACCGT CTCGCTGCGC 

151 CGCGCGGGAA GCGGCGGCGA GGCGCACGGT CAGGGGTTTT GGTCGCTGCT 

201 TCAAGAAACC GGCGTTCCCG TCCTGCCGAA CACGGCAGGC TGCCAAAGCG 

251 TGCAGGAAGC GGTAACGACG GCGCAAATGG CGCGCGAAGT GTTTGAAACC 

3 01 GATTGGATAA AATTGGAACT CAT CGGAGAT GACGACACCT TGCAGCCGGA 
351 TGTGTTCCAG CTTGTCGAAG CGGCGGAAAT CCTGATTAAA GACGGCTTCA 

4 01 AAGTGCTGCC TTATTG CAC C GAAGACCTGA TTGCCTGCCG CCGCCTGCTC 
451 GACGCGGGCT GTCAGGCGTT GATGCCGTGG GCGGCCCCGA TCGGCACGGG 
501 TTTGGGCGCG GTTCACGCCT ACGCGTTGAA CGTCCTGCGC GAACGCCTGC 
551 CCGACACGCC GCTGATTATC GACGCGGGCT TGGGTTTGCC CTCACAGGCG 
6 01 G CAC AAGTGA TGGAATGGGG CTTTGACGGC GTGCTTTTGA ATACTGCCGT 
651 TTCCCGCAGC GGCGATCCGG TCAATATGGC ACGCGCCTTC GCACTCGCCG 
701 TCGAATCCGG ACGGCTGGCA TTTGAAGCCG GACCGGTCGA AGCACGCGAC 
751 AAAGCGCAAG CCAGCACGCC GAC AGT CGG A CAACCGTTTT GG C ATT CGG C 
8 01 GGAATATTGA 



This corresponds to the amino acid sequence <SEQ ID 466; ORF 126>: 

ml26.pep (partial) 

1 . .HYTKEPIMLT LYGETFPSRL LLGTAAYPTP EILKQSIQTA QPAMITVSLR 
51 RAGSGGEAHG QGFWSLLQET GVPVLPNTAG CQSVQEAVTT AQMAREVFET 
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101 DWIKLELIGD DDTLQPDVFQ 

151 DAGCQALMPW AAPIGTGLGA 

201 AQVMEWGFDG VLLNTAVSRS 

251 KAQASTPTVG QPFWHSAEY* 



LVEAAEILIK DGFKVLPYCT EDLIACRRLL 
VHAYALNVLR ERLPDTPLII DAGLGLPSQA 
GDPVNMARAF ALAVESGRLA FEAGPVEARD 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 126 shows 95.9% identity over a 269 aa overlap with a predicted ORF (ORF 126.ng) 
from TV. gonorrhoeae: 

ml26/g!26 

10 20 30 40 

HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQ 

::||| I I I Ml M II ( II MM I II Mi | III II hM i: 
MPSETPKARRRLSDGIASDNHTKESIMLTLYGETFPSRLLLGTAAYPTPEILKQSVRTAR 
10 20 30 40 50 60 



ml26 -pep 
gl26 



50 60 70 80 90 100 

ml 2 6 . pep PAMI TVSLRRAGSGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

Illllllllhl I I I M II I I I M M I M I I I II I M M| I I I I I II I Ml II IMM M 
gl2 6 PAMI TVS LRRTGCGGEAHGQGFWSIjLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

70 80 90 100 110 120 



110 120 130 140 150 160 

m!26 . pep WI KLELI GDDDTLQPDVFQLVEAAEI LI KDGFKVIjPYCTEDLI ACRRLLDAGCQALMPWA 

IIIIIMMIIMIIIMIIIMIIIIIIIIIIMMIIMIIIMMMIIMIIIMI 

g 1 2 6 WI KLELI GDDDTLQPDVFQLVEAAE I LI KDGFKVLPYCTEDL I ACRRLLDAGCQALMPWA, 

130 140 150 160 170 180 



170 180 190 200 210 220 

ml26 .pep APIGTGLGAVHAYALNVLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 

I I Ml M I I I M 1 1 I-: MM I II Ml MM Ml MMIM I 1 1 II I I I! MMM I II 

g 1 2 6 AP I GTGLGAVHAYALKI LRERL PDT PL 1 1 DAGLGLPS QAAQVMEWGFDGVLLNTAVSRSG 

190 200 210 220 230 240 



230 240 250 260 270 

ml2 6 . pep D P VNMARAFALAVE S GRLAF E AGP VE ARD KAQ AS T P TVGQ P FWHS AE YX 

I I II I II I M I I MM M Ml I I M Ml I M 1 1 1 I 1 1 M I 1 1 M 1 1 1 I 

gl26 DPVNMARAFALAVESGRLAFEAGPVEARTKAQASTPTVGQPFWHSAEYX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 467>: 

al26 . seq 

1 TTGTTAATCC ACTATACAAA GGAACCCATT ATGCTCACCC TGTACAGCGA 

51 AACTTTCCCT TCGCGGCTGC TGCTCGGCAC AGCCGCCTAC CCGACCCCTG 

101 AAATCCTCAA ACAATCCGTC CGAACCGCCC GGCCCGCGAT GATTACCGTC 

151 TCGCTGCGCC GCGCGGGATG CGGCGGCGAG GCGCACGGTC AGGGGTTTTG 

201 GTCGCTGCTT CAAGAAACCG GCGTTCCCGT CCTGCCGAAC ACGGC AGGCT 

251 GCCAAAGCGT GCAGGAAGCG GTAACGACGG CGCAAATGGC GCGCGAAGTG 

301 TTTGAAACCG AT TGGAT T AA ACTCGAACTC ATCGGCGACG ACGACACCTT 

351 GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC GGCGGAAATC CTGATTAAAG 

4 01 ACGGCTTCAA AGTGCTGCCT TATTGCACCG AAGACCTGAT TGCCTGCCGC 

4 51 CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG ATGCCGTGGG CGGCCCCGAT 

501 CGGCACGGGT TTGGGCGCGG TTCACGCCTA CGCGTTGAAC GTCCTGCGCG 

551 AACGCCTGCC CGACACGCCG CTGATTATCG ACGCGGGCTT GGGTTTGCCC 

601 TCACAGGCGG CACAAGTGAT GGAATGGGGC TTTGACGGCG TGCTTTTGAA 

651 TACTGCCGTT TCCCGCAGCG GCGATCCGGT CAATATGGCA CGCGCCTTCG 

701 CACTCGCCGT CGAATCCGGA CGGCTGGCAT TTGAAGCCGG ACCGGTCGAA 

751 GCACGCGACA AAGCGCAAGC CAGCACGCCG ACAGTCGGAC AACCGTTTTG 

801 GCATTCGGCG GAATATTGA 
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This corresponds to the amino acid sequence <SEQ ID 468; ORF 126.a>: 

al2 6.pep 

1 LLIHYTKEPI MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV 

51 SLRRAGCGGE AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VT T AQMARE V 

101 FETDWIKLEL IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR 

151 RLLDAGCQAL MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP 

201 SQAAQVMEWG FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAG PVE 

251 ARDKAQASTP TVGQPFWHS A EY * 

ml26/al26 98.1% identity in 269 aa overlap 

10 20 30 40 50 

HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGE 

| | | | j | M I I I I : I II I I I I I I I I I M I I II I I I It : : I I : I M I I I I M I II III 
LLIHYTKEPIMLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGE 
10 20 30 40 50 60 

60 70 80 90 100 110 

AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 
| | | M I | I I I I I I I I I I I I I I I I I M I I I I M I I I I M I I I M I I I I II I I I I I I I I I I I 
AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 
70 80 90 100 110 120 

120 130 140 150 160 170 

VFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 
M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I It I I I I I I I I I I I I I I I I I M I I I I I I 
VFQLVEAAEI LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 
130 140 150 160 170 180 

180 190 200 210 220 230 

VLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 

I M | | | | | I I II I I II M I I I I I I I M I I ! I I I I I I I I I M I I I I I II I I I I I I I I I I I I 
VLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 
190 200 210 220 230 240 

240 250 260 270 

RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 
250 260 270 



ml 2 6 .pep 
al26 

ml 2 6 . pep 

al26 

ml26.pep 
al26 

ml26 .pep 
al26 

ml26.pep 
al26 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 469>: 

gl26-l . seq 

1 ATGCTCACCC TGTACGGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 GGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCACGGGATG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAC GTGTTCCAAC TCGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ATGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCTCCCAT CGGCACGGGT TTGGGGGCGG TTCACGCCTA 

501 TGCGCTCAAA ATCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCT TCCCAAGCGG CACAAGTGAT GGAATGGGGT 

601 TTTGACGGCG TATTGTTAAA CACCGCCGTT TCCCGCAGCG GCGACCCCGT 

651 CAACATGGCG CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG GCCGGTCGAA GCGCGAACCA AAGCCCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 470; ORF 126-l.ng>: 

gl26-l .pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRTGCGGE 
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51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALK ILRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARTKAQASTP 

251 TVGQPFWHSA EY* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47 1>: 

ml 2 6-1 . aeq 

1 ATGCTCACCC TATACGGCGA AACTTTCCCC TCGCGGCTGC TGCTCGGCAC 

51 GGCTGCCTAC CCGACCCCCG AAATCCTCAA ACAATCCATC CAAACCGCCC 

101 AGCCTGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGAAG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 AT C G GAG AT G ACGACACCTT GCAGCCGGAT GTGTTCCAGC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

7 51 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 5; ORF 126-1>: 

ml 2 6-1 .pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSI QTAQPAMITV SLRRAGSGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 

ml26-l/gl26-l 96.9% identity in 262 aa overlap 

10 20 30 40 50 60 

MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGEAHGQGFWSLL 

| j | | |[ I I ! 1 I I I I I I I I i 11 I I I I I I I I 1 I : I I I I I I E I I 1 • I I ! t I I 1 I ( I i I i I 
MLTLYGETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRTGCGGEAHGQGFWSLL 
10 20 30 40 50 60 



ml26-l .pep 
g!26-l 



70 80 90 100 110 120 

ml 2 6-1 pep QETGV P VLPNTAGCQSVQEAVTTAQMAREVFETDW I KLEL IGDDDTLQPD VFQLVEAAEI 
| | | If | | I M I I I I I I I I M It I I I I It t I i I I I I 1 I I I I t I I I I I I I I I I t I I ! I M I I 
Q12 6-1 QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml26-l pep L I KDGFKVLPYCTEDLI AC RRLLDAGCQALMPWAAP I GTGLGAVHAYALN VLRERLPDTP 

I I | I I I I I I I I I I I I I I I I 11 I I I E I 1 I 1 M I 1 I I 1 I I I H I I I M I I I I 1 I I I i I I I 
ql26-l LI KDGFKVLPYCTEDLI AC RRLLDAGCQALMPWAAP I GTGLGAVHAYALKILRERLPDTP 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml26-l pep LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
| | | | I | I | | | | | 1 I I I I i I I I I I I I I I It I I I 1 I I I I I I I I I I I I I I ! I I I I I I I I I I I I 
ql26-l LI I DAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

190 200 210 220 230 240 

250 260 
ml 2 6-1 .pep ARDKAQASTPTVGQPFWHSAEYX 

II I I I 1 I I I t ! I I i I I I I I I I I 
gl26-l ARTKAQASTPTVGQPFWHSAEYX 

250 260 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 473>: 

al26-l . seq 

1 ATGCTCACCC TGTACAGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 
51 AGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 
101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGATG CGGCGGCGAG 



WO 99/57280 



PCT/US99/09346 



360 



151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATTAA ACTCGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

4 51 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

7 51 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 474; ORF 126-1 .a>: 

al26-l .pep 

1 MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRAGCGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 

al26-l/ml26-l 98.1% identity in 262 aa overlap 

10 20 30 40 50 60 

al2 6-l.pep MLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGEAHGQGFWSLL 
I I I I I : I II II I I I I I 1 I I I I I ! I t I I I I : : II : I I I I I I I I I I I I I I I I II I I I I I I I 
ml26-l MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGEAHGQGFWSLL 

10 20 30 40 50 60 

70 80 90 100 110 120 

a!2 6-l .pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
I I I I I I II I I I I I I I I I I I i I I I I M I i I I I I I I I I I I I I I I II I I I I I I I I IE I I I I I I 
ml2 6- 1 QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 

70 80 90 100 110 120 

130 140 150 160 170 180 

al26-l.pep LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 
I I II I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I 
ml2 6-l LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 

130 140 150 160 170 180 

190 200 210 220 230 240 

al2 6-1 . pep LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
i I I I I I I I I M 1 I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I II I I I I I I II I I I I M I I 
ml2 6-l L 1 1 D AGLGL P S Q AAQVME WG F DG VLLNT A V S RS G D P VNMARA FA L A VE S G RL A FE AG P V E 

190 200 210 220 230 240 

250 260 
a 12 6-1 . pep ARDKAQAST PT VGQP FWH S AE YX 
I I I I I I II I I I I i I I I I I I I I I I 
ml2 6-l ARDKAQASTPTVGQPFWHSAEYX 

250 260 



The following partial DNA sequence 



gl27 . seq 






1 


ATGGAAATAT 


GGAATATGTT 


51 


CGCGGAGGCG 


GCCGAATCCG 


101 


GCGCCCTTCT 


GTTGAATATC 


151 


GAAAGCAAGC 


GGCGGTTTTT 


201 


GGTGCTGTTT 


TCGCTGGCAT 


251 


CTTTGTCGAT 


GTTTGCGGTG 


301 


CTGATTATGT 


GTCTGTCGGG 


351 


CTCGGTCGGC 


GACTATATCG 


401 


ACATCAATCT 


GTTGAACACG 


451 


GTCGGACAGC 


TTGCGGGAAC 


501 


GAGCCACCCC 


GTGCGCCGCG 



identified in N. gonorrhoeae <SEQ ID 475>: 

GAACACTTGG CCCGATGCCG TCCCGATACG 
TGGCGGCGGT CGCGGCTTTG CTGCTGGCGC 
CACT TCAGAC GGCATCCGGA TTTCGGCATC 
GGTTGCCAGC CGCAATATAA CGCTGCTTTT 
TTATCTGGTC GGCGCAAATT CAAACG CTGG 
GCGGCGGCGG TCGTCGTGGC GACAAAAGAA 
CAGTATTTTA aggtctGCCA C CCAGC AATA 
AAATCAACGG CCTGCGCGGG CGCGTGGTCG 
CTGATGATGC AGGTCGGTCC GAACCCCTTG 
CACCGTTTCT TTCCCCAACA GCCTGTTGTT 
ACAATATTTT GGGCGACTAT GTCAT C CAT A 
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551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

6 01 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 TCAGCGGTAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGGCC GCGCGTTACC CGCGTACCGT ACGACGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAAT CCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAAT CAT C 

851 CCGCCGgctc cgAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 476; ORF 127.ng>: 

gl27 .pep 



1 MEIWNMLNTW PDAVPIRAEA AES VAAVAAL LLARALLLNI HFRRHPDFGI 

51 ESKRRFLVAS RNITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRY LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 477>: 



1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TCCCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGCGGT TGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT T CG CTGG CAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

3 01 CTGAT TATGT GTCTGTCGGG CAGTATTTTA AGGTCTGCCA CCCAGCAATA 

3 51 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

401 AC AT CAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 

451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 

501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 

551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 CCAACGG S AT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCTTC CCCCGTTTCA AAGCGGCTGG AAAT CCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCACC 

851 CCGCCGGCTC CGAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 478; ORF 127>: 

ml27 .pep 

1 ME I WNMLDTW LGAVPIRAEA VES VAAVAAL LLARALLLNI HFKRHPDFGI 
51 ESKRRFLVAS R NITLLLVLF SLAFIWSA QI QTLALSMFAV AAAVWATKE 



101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRX LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 127 shows 97.9% identity over a 290 aa overlap with a predicted ORF (ORF 127.ng) 
from N. gonorrhoeae: 

ml27/g!27 



ml27 . seq 



ml27 .pep 



10 20 30 40 50 60 

ME IWNMLDTWLGAVP I RAEAVES VAAVAALLLARALLLNI HFKRHPDFG I ES KRRFLVAS 



g!27 




10 20 30 40 50 60 



ml27 -pep 



70 80 90 100 110 120 

RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAWVATKELIMCLSGSILRSATQQYSVG 



gl27 
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70 80 90 100 110 120 

130 140 150 160 170 180 

ml27 . pep DYIEINGLRGRVVDINLLNTLMMQVGPNPL.VGQLAGTTVSFPNSLLLSHPVRRDNILGDY 

I Ml M I I I M I II I I MIMI I I i I Ml IMMIIIIIII ! I I I II II MM II III I I 

gl2 7 DYIEINGLRGRWDINLLNTLMMQVGPNPLVGQIiAGTTVSFPNSLLLSHPVRRDNILGDY 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml2 7 .pep VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRXLENVQAEKLFITPAARPRVT 

M M I M M M I I M M I I I MM Ml M III II IMM I M I I M I I 1 1 1 1 I I M I I I 

gl2 7 VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRYLENVQAEKLFITPAARPRVT 

190 200 210 220 230 240 

250 260 270 280 290 

ml27 . pep RVP YDDKAYRI I VRFAS PVS KRLE I QQAVMDE FLRVQYRLLNHPAGSETLX 

I I M I II M M M M M M I I I M I M MM MMMM ! M I I II I I I II 

gl 2 7 RVP YDDKAYRI I VRFAS PVS KRLE I QQAVMDE FLRVQYRLLNHPAGSETLX 

250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 479>: 

al27 . seq 

1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TCCCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGTGGT CGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

301 CTGATTATGT GTCTGTCGGG CAGCATTTTA AGGTCTGCCA CCCAGCAATA 

351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

4 01 ACATCAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 

4 51 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 

501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAC GT CAT CC AT A 

551 CGGTCGAAAT CCCGGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 CCAACGGCAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

7 01 CCGCCAAACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 

7 51 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 
801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATTACC 

8 51 CCGCCGGCTC CGAAACACTT TAA 

This corresponds to the amino acid sequence <SEQ ID 480; ORF 127.a>: 

al27 .pep 

1 MEIWNMLDTW LGAVPIRAEA VES VAVVAAL LLARALLLNI HFKRHPDFGI 

51 ESKRRFLVAS RN ITLLLVLF SLAFIWSA QI Q TLALSMFAV AAA VW AT K E 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

2 01 RLKAVLEPLC APYIPAIQRH LENVQAEKLF ITPAAKPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNYPAGSETL * 

ml27/al27 98.6% identity in 290 aa overlap 

10 20 30 40 50 60 

ml27 . pep MEIWNMLDTWLGAVPIRAEAVESVAAVAALLLARALLLNIHFKRHPDFGIESKRRFLVAS 
I I I I I I I I I I II I I I I I I I I I i I I I : I 1 I I I I I I I 11 I I I I I I I I I I I I I M I I I I I I I I 
al27 MEIWNMLDTWLGAVPIRAEAVESVAVVAALLLARALLLNIHFKRHPDFGIESKRRFLVAS 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 2 7 .pep RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVVVATKELIMCLSGSILRSATQQYSVG 
I I I I I I I I I I I I I I I I I I I I II I I I I I t I I I I I II I 1 I I I I I t I I I I I I I i I I I I I I | I | 
al27 RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVVVATKELIMCLSGSILRSATQQYSVG 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml27 . pep DYIEINGLRGRVVDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 
M I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I II I I II II I I I I I I I I II I I I M I II 
al27 DYIEINGLRGRVVDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 

130 140 150 160 170 180 

190 200 210 220 230 24 0 

ml27 .pep VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRXLENVQAEKLFITPAARPRVT 
I I I I I I I I I I I I II M I I I I M I I I I II I I I I I I I I I I I II 1 I M II I M I I I I : I I I I 
al27 VIHTVEIPVPIHLDSDEAVCRLPCAVLEPLCAPYIPAIQRHLENVQAEKLFITPAAKPRVT 

190 200 210 220 230 240 

250 260 270 280 290 

ral27 .pep RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDE FLRVQYRLLNHPAGSETLX 

I I I I I I I I II I I I M 1 I I I I I I I I! I I I I I I I I I I I I t I I I I : I J I I II I I 
al27 RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDE FLRVQYRLLNYPAGSETLX 

250 260 270 280 290 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 48 1>: 

gl28 . seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC AT CGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCAT CA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGccaaa cTGCTCGGCT T T AAAAATT A CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 

1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAAT GGAG 

16 51 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 

1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 

1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 

1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 

1851 cAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 

1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg CtCCCGCAGC 

1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 



WO 99/57280 



PCT/US99/09346 



364 



2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 482; ORF 128.ng>: 

gl28 .pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA IAEARGQIAA V KAQTHTGW A 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

3 01 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

401 I GGVYMDIi YA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

451 GKEARLSHDE I LTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE . TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANS FGHIF 

601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EIIiAVGGSRS 

651 AAESFKAFRG REPSIDALLR QSGFDNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 483>: 



ml28 . seq 


(partial) 


1 


ATGACTGACA 


51 


AATCAAAACC 


101 


CGCGCGAACA 


151 


AACACTGTCG 


201 


GGGCGTGGTG 


251 


CCGTCTATAA 


301 


GGACAAGACA 


351 


CGAATTCGAC 


1 


TACGCCAGCG 


51 


wGTCAAAAAA 


101 


AAmTCAAAAA 


151 


TGG CACAAAG 


201 


AGGCGGCGTT 


251 


CGTGGATGAA 


301 


CAAyTGCCCA 


351 


CAGGGAAGCC 


401 


CCGGACACGG 


451 


TCCGGCATCA 


501 


TATGGAAAAT 


551 


ACGAAGAAAC 


601 


GCCGCCAAAA 


651 


CGCCCTCTTT 


701 


AAAACTGGCA 


751 


CAGCCGCCCG 


801 


AGGCGG CT AT 


851 


GCGCGGACGC 


901 


GGCAAACGCT 


951 


nGCAGAATCC 


1001 


TCTTGCGCCA 



This corresponds to the amino acid sequence <SEQ ID 484; ORF 128>: 

ml28 .pep (partial) 

1 MTDNALLHLG EEPRFDQIKT ED I KPALQTA IAEAREQIAA I KAQTHTGW A 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



// 



1 YASEKLREAK YAFS ETXVKK YFPVGXVLNG LFAQXKKLYG I GFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRRFSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXE FALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 
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301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 

10 20 30 40 50 60 

gl28 pep MIDNALLHLGEEPRFNQIQTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 

I II I I M MMI I I Ml M I I M! Ml II M Ml 1 1 1 1 = I I I I I I I I I I I f I Mill 

m i 2 8 MTDNALLHLGEE PRFDQ I KTED I KPALQTAI AEAREQI AAI KAQTHTGWANTVEPLTG I T 

10 20 30 40 50 60 

70 80 90 100 110 120 

gl28 . pep ERVGRI WGVVSHLNS VVDTPELRAVYNELMPE I TVFFTE I GQD I ELYNRFKT I KNS PEFA 

I I I I 1 1 I I II I II I IMIMMM MMI II IMMIM M M.M I I I M II IIMM 

ml 2 8 ERVGRI WGWSHLNCVADTPELRAVYNELMPE I TVFFTE I GQDI ELYNRFKT I KNS PEFD 

70 80 90 100 110 120 



130 140 150 160 170 , 180 

gl2 8 . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

lllllllllhl 
ml 2 8 TLSPAQKTKLNH 

130 

// 

340 350 360 

gl28 pep Y AGE KLREAKYAF SET E VKKY F P VG KVLAG 

1 1 = 11 Mill 1 1 II I I MM MM M I 

ml 2 8 YAS E KLREAKYAF S ETX VKK Y F P VGXVLNG 

10 20 30 



370 380 390 400 410 420 

gl2 8 . pep LFAQIKKLYGIGFAEKTVPVWHKDVRYFELQQNGKTIGGVYT4DLYAREGKRGGAWMNDYK 

MM I II II 1 1 h M M 1 1 1 1 1 1 1 M 1 1 M I h M M I II II I I M M M I M 1 1 1 M 

ml2 8 LFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWMNDYK 

40 50 60 70 80 90 



430 440 450 460 470 480 

gl2 8 . pep GRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVDELGV 

1 1 1 1 I • 1 1 1 1 1 1 1 1 M 1 1 M 1 1 I 1 1 I h M M II I 1 1 I 1 1 ! 1 1 1 I ! 1 1 i 1 1 1 1 1 1 1 ! 1 1 

ml 2 8 GRRRFSDGTLQLPTAYLVCNFAP PVGGREARLSHDE I LI LFHETGHGLHHLLTQVDELGV 

100 110 120 130 140 150 



490 500 510 520 530 540 

gl2 8 . pep SGI NGVEWDAVE L P S Q FME N FVWE YNVLAQMS AHE E TGE P L P KE LFD KMLAAKN FQRGMF 

MMM I 1 1 11 1 1 1 1 1 1 1 i 1 1 II I 1 1 1 1 1 I M II M llllll II lllllll Ml 

ml2 8 SGINGAOCWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXIiAAKNFQXGMF 

160 170 180 190 200 210 



550 560 570 580 590 600 

gl28.pep LVRQMEFALFDMMIYSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

Ml I II I II I Ml I hi I I M 1 I ! t i I 1 1 I 1 1 = 1 1 1 1 I I t 1 1 I 1 1 I MIIIIMM 

ml28 XVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIFAGGY 

220 230 240 250 260 270 



610 620 630 640 650 660 

gl28 . pep S AG Y YS YAW AE VLSTD A YAAFEESDDVAATGKRFWQE I LAVGGSRSAAES FKAFRGREPS 

I M II I M I M 1 1 : 1 1 M M M M M I II M I M 1 1 M II I II I = M M I II I 1 1 1 II 

ml 2 8 SAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRGREPS 

280 290 300 310 320 330 
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670 679 
gl28 .pep IDALLRQSGFDNAAX 

Illllhlllllh 
ml2 8 I DALLRHSGFDNAVX 

340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 485>: 

al28 . seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACG C AC AC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAAC CAT C A AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

7 51 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 
801 AACCGCCAAA CTGCTCGGCT T CAAAAACT A CGCCGAATTG TCGCTGGCAA 

8 51 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 C CAAAT C AAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AAC C G G AC AC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GAT T T AC AG C GAAGACGACG AAGGCCGTCT 

17 01 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

17 51 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGC AAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 486; ORF 128.a>: 

al28 .pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml2 8.pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I M I M I I M I II II II I I I I I I I 
al2 8 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml28 .pep ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

I I I I I I I I I I I I I I i : I I I I I I! : I M I I I I I I I I I I I I I II I I M I I I I t I II I I I I I 
al28 ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 

ml 2 8 . pep TLS PAQKTKLNH 

II I I I I I I I I i 

al2 8 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



ml28.pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 

190 200 210 220 230 240 



ml 2 8. pep 

al28 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

140 150 

ml28 . pep YASEKLREAKYAFSETXVKKYFPVGX 

I f : I M I M I I I I I I I I I I I I I I I 
al28 ARRAK P Y AE K D L AE VKA FARE S LG L A D LQ P W D LG Y AG E K LRE AK Y A F S E T E VKK Y F P VG K 

310 320 330 340 350 360 

160 170 180 190 200 210 

ml28 . pep VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
I I i I I I I I I I I I I I I 11 I I I I I I I I I I I I I I II I I I I : I I ! I I I I I I I I I I I I M I I I 
al28 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

220 230 240 250 260 270 

m!28 . pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
I M I I I I I I I I II I I I I I I I I I II I : I I I I I : I I I I I I I I I I I I I I I I M I I II I I I i I 
al28 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

280 290 300 310 320 330 

ml28 . pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
I I I I I I I I I I I M I I M I II I I I I I I II I I I I I I I I I I I M I I I I I I I 1 I I I I I I I 
al28 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

340 350 360 370 380 390 

m!28.pep XGMFXVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
M J III I I I I I 1 I I I I I I I I M I I I N I I I I I I I I I : I I I :: I I I I I | I | MINI 
a 128 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAVVRPPEYNRFANSFGHIF 

550 560 570 580 590 600 
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400 410 420 430 440 450 

AGGYSAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRG 
! | E I I I : I M I I I I II I I I I I I I I I t I I I I I I I I ! I M I M I I I I 111:111111111 
AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 

460 470 
REPSIDALLRHSGFDNAVX 

I I I I I I I I I I I I I I I I I = 
REPSIDALLRHSGFDNAAX 
670 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 487>: 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AG CGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

2 51 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 
301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 
351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GAT CACGACC 
401 TGCGCGATTT CGTATTGAGC GGCG CGGAAC TGCCGCCCGA ACGGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

5 01 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
651 GCACTACCTT GCCGTTATCC AATACGC CGG CAACCGCGAA CTGCGCGAAC 
7 01 AAATCTACCG CGC CTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 
801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACG CCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGG CGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

1451 TGTC CGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEQ ID 488; ORF 128-1. ng>: 

gl2 8 - 1 .pep (partial ) 

1 MIDNALLHLG EEPRFNQIKT ED I KPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTG I T ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRD FVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAG I PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFIjHDL 

3 01 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VLAGIiFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 
4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 489>: 

ml28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC AT CAAAG CCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 



ml28 .pep 
al28 

ml28 .pep 
al28 
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2 01 GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCG 
251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTCGAC ACCCTCTCCC CCG C ACAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CC AAATT CTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

5 51 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
601 G CCG CGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 

6 51 ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAACTTTC AGACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGCAA ACGCCCTGCA 
8 01 AACCG CCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAAAG CCTG A ACCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCAG CGAAAAACTG CG CGAAGCC A AATACGCGTT C AG CGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACT CTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGG CACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

12 01 ATAGGCGGCG TTTATATGGA TTTGTACG C A CGCGAAGGCA AACGCGGCGG 

12 51 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 

1351 GGCAGGGAAG CCCGCCTGAG CCACGACGAA ATCCTCATCC TCTTCCACGA 

14 01 AACCGGACAC GGGCTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 

1551 CCACGAAGAA ACCGG CGTTC CCCTGCCGAA AGAACTCTTC G AC AAAATG C 

1601 TCGCCGCCAA AAACTTCCAA CGCGGCATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT G ATTT AC AG C GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAG ACAG CGT GCGCAAAAAA GTCGCCGTCA 

1751 TCCAGCCGCC CGAATACAAC CGCTTCGCCT TGAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAG CGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG AT CG CGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCATAGACGC 

2 001 ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA 



This corresponds to the amino acid sequence <SEQ ID 490; ORF 128-1>: 

ml28-l .pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLANALQTAK LLGFKNYAEL SIATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

451 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDD EGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 

6 01 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG RE PS I DAL LR HSGFDNAV* 



ml28-l/gl28-l 94.5% identity in 491 aa overlap 

10 20 30 40 SO 60 

gl28 - 1 pep MIDNALLHLGEEPRFNQIKTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 

I MMM III I I I I M Mill MMM I I Mill M IMMI MM I Ml II Mill 

ml 2 8 - 1 MTDNALLHLGEEPRFDQ I KTED I KPALQTAI AEAREQ I AAI KAQTHTGWANTVE PLTG I T 

10 20 30 40 50 60 



70 80 90 100 110 120 

gl28 - 1 . pep ERVGRIWGWSHLNSVVDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

1 1 J M 1 1 1 M 1 1 j 1 1 1 ^ I 1 1 1 1 II F 11 1 1 1 1 M i 1 1 1 1 1 i II 1 1 i M 1 1 1 1 1! E I F 1 1 1 

ml 2 8 -1 ERVGRIWGWSHI^SVADTPELRAvYNEIiMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 
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gl28-l -pep 



ml28-l 



130 140 150 160 170 180 

TLSPAQKTKLDHDLRDFVLSGAEIiPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

I M MMIMM IMI III Ml I MIMMMMIM I Mil I MIMMMIMMM I 

TLSPAQKTKLNHDLRDFVLiSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 



190 200 210 220 230 240 

gl2 8-1 .pep FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYIAVIQYAGNRELREQIYRAYV 

II M I Mill I i I I I Ml M M I I MMMI Mill I M MM Ml I 1 1 1 1 1 1 1 1 1 1 I I 

ml28-l FDD AA P LAG I PE D ALAM FAAAAQ S E S KTG YK I GLQI PH YLA V I Q YADNRE LREQ I YRA YV 

190 200 210 220 230 240 

250 260 270 280 290 300 

gl2 8 - 1 . pep TRASELSNDGKFDNTANIDRTLENALKTAKLIiGFKNYAELtSLiATKMADTPEQVLNFLHDL 

i | | | | I I : I I I I I I I I I I I I I I I I h I II I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 2 8 - 1 TRASEIiSDDGKFDNTANIDRTIjANAIjQTAKLLGFK>rYAEIiSLiATKMADTPEQVLNFIjHDL 
250 260 270 280 290 300 

310 320 330 340 350 360 

gl2 8 - 1 . pep ARRAKPYAEKDLAEVKAFAREHLGLADPQPWDLSYAGEKIjREAKYAFSETEVKKYFPVGK 

Ml II! IMM III Mill M hill MM I M MM I II I M I II II III IM M M 

ml28-l ARRAKPYAEKDLAEVKAFARESLNI^LQPWDLGYASEKL^EAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 



gl2 8-l .pep 



ml28-l 



370 380 390 400 410 420 

VLAGLFAQ I KKLYG IGFAEKTVPVWHKDVRYFELQQNGKTI GGVYMDL.YAREGKRGGAWM 

II Ml III M MIMMMMI I IMIMIMIMM M III III I IIMMIM1 M 

VLNGLFAQI KKLYG IGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 



gl2 8-l -pep 



m!28-l 



430 440 450 460 470 480 

NDYKGRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGIiHHLLTQVD 

II I IMMMI I MM IM Ml I MM IMMMMMIMI I 1 1 1 1 1 1 1 1 I 1 1 1 1 II I 

NDYKGRRRFSDGTLQLPTAYLVCNFAP P VGGREARLSHDEI LI LFHETGHGLHHLLTQVD 
430 440 450 460 470 480 



490 

gl2 8 - 1 . pep ELGVSGINGVK 

MMIIIMM 

ml2 8-1 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 49 1>: 

al28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGC CGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAG CCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

451 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGG CT AC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAG CCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACG C A CGCGAAGGCA AACGCGGCGG 
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1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 



CGCGTGGATG 
TGCAACTGCC 
GGCAAAGAAG 
AACCGGACAC 
TATCCGGCAT 
TTTATGGAAA 
CCACGAAGAA 
TCGCCGCCAA 
TTCGCCCTCT 
GAAAAACTGG 
TCCGACCGCC 
GCAGGCGGCT 
GAG CGCGG AC 
CAGGCAAACG 
GCGGCAGAAT 
ACTCTTGCGC 



AACGACTACA 
CACCGCCTAC 
CCCGCTTGAG 
GGCCTGCACC 
CAACGGCGTA 
ATTTCGTTTG 
ACCGG CGTTC 
AAACTTCCAA 
TTGATATGAT 
CAACAGGTTT 
CGAATACAAC 
ATTCCGCAGG 
GCATACGCCG 
CTTTTGGCAG 
CCTTCAAAGC 
CACAGCGGCT 



AAGGCCGCCG 
CTCGTCTGCA 
CCATGACGAA 
ACCTGCTTAC 
GAATGGGACG 
GGAATACAAT 
CCCTGCCGAA 
CGCGGAATGT 
GATTTACAGC 
TAGACAGCGT 
CGCTTCGCCA 
CTATTACAGC 
CCTTTGAAGA 
G AAATCCT CG 
CTTCCGCGGA 
TCGACAACGC 



CCGTTTTTCA 
ACTTCACCCC 
ATCCTCACCC 
CCAAGTCGAC 
CAGTCGAACT 
GTCTTGGCGC 
AGAACTCTTC 
TCCTCGTCCG 
GAAGACGACG 
GCGCAAAGAA 
ACAGCTTCGG 
TACGCGTGGG 
AAGCGACGAT 
CCGTCGGCGG 
CGCGAACCGA 
GGCTTGA 



GACGGCACGC 
GCCCGTCGGC 
TCTTCCACGA 
GAACTGGGCG 
GCCCAGTCAG 
AAATGTCCGC 
GACAAAATGC 
CCAAATGGAG 
AAGGCCGTCT 
GTCGCCGTCG 
CCACATCTTC 
CGGAAGTATT 
GTCGCCGCCA 
ATCGCGCAGC 
GCATAGACGC 



This corresponds to 

al28-l .pep 

1 MTDNALLHLG 

51 NTVEPLTGIT 

101 GQDIELYNRF 

151 ELAKLQTEGA 

2 01 AAQSEGKTGY 

2 51 KFDNTANIDR 
301 ARRAKPYAEK 

3 51 EVKKYFPVGK 

4 01 IGGVYMDLYA 
451 GKEARLSHDE 
501 FMENFWEYN 
551 FALFDMMIYS 
6 01 AGGYSAGYYS 
651 AAESFKAFRG 



the amino acid sequence <SEQ ID 492; ORF 128-l.a>: 



EEPRFDQIKT 
ERVGRIWGW 
KTIKNSPEFD 
QLSAKFSQNV 
KIGLQIPHYL 
TLENALQTAK 
DLAEVKAFAR 
VLNGLFAQIK 
REG KRGGAWM 
ILTliFHETGH 
VLAQMSAHEE 
EDDEGRLKNW 
YAWAEVLSAD 
RE P S I DALLR 



EDIKPALQTA 
SHLNSVTDTP 
TLSHAQKTKL 
LDATDAFG I Y 
AVI QYADNRK 
LLGFKNYAEL 
ESLGLADLQP 
KLYGIGFTEK 
NDYKGRRRFS 
GLHHLLTQVD 
TGVPLPKELF 
QQVLDSVRKE 
AYAAFEESDD 
HSGFDNAA* 



IAEAREQIAA 
ELRAAYNELM 
NHDLRDFVLS 
FDDAAPLAG I 
LREQIYRAYV 
SLATKMADTP 
WDLGYAGEKXi 
TVPVWHKDVR 
DGTLQLPTAY 
ELGVSGINGV 
DKMLAAKNFQ 
VAWRPPEYN 
VAATGKRFWQ 



I KAQTHTGWA 
PEITVFFTEI 
GAELPPEQQA 
PEDALAMFAA 
TRASELSDDG 
EQVLNFLHDL 
REAKYAFSET 
YFELQQNGET 
LVCNFTPPVG 
EWDAVELPSQ 
RGMFLVRQME 
RFANSFGHIF 
EILAVGGSRS 



ml28-l/al28-l 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

al2 8 - 1 . pep MTDNALLHLGEE PRFDQ I KTEDI KPALQTAI AEAREQI AAI KAQTHTGWANTVE PLTG I T 

I M i II MM M M M I lllllll IM IMMMIMM III I! Ill II I M M IM M I 

ml2 8 - 1 MTDNALLHLGEE PRFDQ IKTEDI KPALQTAI AEAREQI AAI KAQTHTGWANTVE PLTG I T 

10 20 30 40 50 60 

70 80 90 100 110 120 

al28-l .pep ERVGRIWGWSHIiNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

Ml I 1 1 M Ml I I M M II MM MM MM I M II i II IM 1 1 M M II I i Ml II 1 1 I 

ml 2 8 - 1 ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 

130 140 150 160 170 180 

al 2 8 - 1 . pep TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKIiQTEGAQLSAKFSQNVLDATDAFGI Y 

III II I I II II MM llllllill Ml IMM Mill I IM M II II I I I M I I M II I 

m!2 8 - 1 TLS PAQKTKLNHD LRD FVLSGAEL P P EQQAE LAKLQT EG AQLS AKF S QNVLDATDAFG I Y 

130 140 150 160 170 180 

190 200 210 220 230 240 

al2 8 - 1 . pep FDD AAP LAG I PEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQI YRAYV 

Ml I II: MIMIII MM MM MMI MMM MIMMIMMM IMMM IM I M 

ml 2 8 - 1 FDD AAP LAG I PEDALAMFAAAAQSESKTGYKIGLQI PHYLAVIQYADNRELREQI YRAYV 

190 . 200 210 220 230 240 

250 260 270 280 290 300 

al2 8 - 1 . pep TRASELSDDGKFDNTANIDRTLENAIjQTAICLLGFKNYAELSliATKMADTPEQVLNFIiHDL 

I I I III I II I M MM I MMI MIMMIMMM M III IMMM II II I Ml III 

ml 2 8 - 1 TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 



310 320 330 340 350 360 

al2 8 - 1 - pep ARRAKPYAEKDIiAEVKAFARESIiGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 

I I i III I III II MM II I III IM MMM MMMM Ml I MM I I II II I III Ml 
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ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

370 380 390 400 410 420 

VLNGLFAQ I KKLYG IG FTEKT VP VWHKD VRYFELQQNGET I GG VYMDL YAREGKRGGAWM 

I! IMIMIIII Mill I IIMI Mill III Ml II Mill IMIM I! Mill Mill I 

VLNGLFAQ I KKLYG IGFTE KTV PVWHKDVRYFE LQQNGET I GG VYMD LYAREG KRGGAWM 
370 380 390 400 410 420 

430 440 450 460 470 480 

NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

I i I M Mill M I II II I MM MMI I Ml M IMMIM I I 1 1 1 1 1 1 1 1 1 1 1 M I i I 

NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

490 500 510 520 530 540 

ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

i e 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ELGVSGINGVEWDAVELPSQFMENFWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

550 560 570 580 590 600 

RGMFLVRQMEFALFDMMI YSEDDEGRLKNWQQVLDS VRKEVAWRPPE YNRFANSFGHI F 

I M MM Ml II Mill I I II MM I II Ml II Mill I M I!-! I IMIM 1 1 1 i I { 

RGMFLVRQME FALFDMMI YSEDDEGRLKNWQQVLDS VRKKVAVI QP PE YNRFALSFGH I F 
550 560 570 580 590 600 

610 620 630 640 650 660 

AGG YS AG YYS YAWAEVLS ADA YAAFE E SDDVAATGKR FWQE I LAVGGS RS AAES FKAFRG 

1 1 1 1 1 1 1 f 1 1 I i 1 1 r 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 I f 1 1 1 1 1 1 E E I E 

AGG YS AG YYS YAWAEVLS ADA YAAFEESDDVAATGKRFWQE I LA VGGSRSAAES FKAFRG 
610 620 630 640 650 660 

670 679 
REPSIDALLRHSGFDNAAX 

I IIMIIMI M IMIM 
RE PS IDALLRHSGFDNAVX 
670 

al28-l/ P44573 

sp | P44 573 | OPDA_HAEIN OLIGOPEPTIDASE A >gi | 1075082 | pir | | C64 055 oligopeptidase A (prlC) homolog 
- Haemophilus influenzae (strain Rd KW20) 

>gi| 1573174 (U32706) oligopeptidase A (prlC) [Haemophilus influenzae Rd] Length = 681 
Score = 591 bits (1507) , Expect = e-168 

Identities = 309/677 (45%), Positives = 415/677 (60%), Gaps =: 4/677 (0%) 



Query: 


4 


NALLH LG E E PRFD Q I KTED I KPALQTXXXXXXXXXXXXXXXTHTGWANTVE PLTG I TERV 


63 






N LL++ P F QIK E I+PA++ H W N + PLT +R+ 




Sb j ct : 


5 


NPLLNIQGLPPFSQIKPEHIRPAVEKLIQDCRNTIEQVLKQPHFTWENFILPLTETNDRL 


64 


Query: 


64 


GRI WGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDI ELYNRFKTI KNSPEFDTLS 


123 






R W VSHLNSV ++ ELR AY +P ++ + T +GQ LYN + +KNS EF S 




Sbjct : 


65 


NRAWSPVSHLNSVKNSTELREAYQTCLPLLSEYSTWVGQHKGLYNAYLALKNSAEFADYS 


124 


Query: 


124 


HAQKTKLWHDLRDF^LSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 


183 






AQK + + LRDF LSG L E+Q -f+ ++L+++FS NVLDAT + + + 




Sbjct: 


125 


IAQKKAI ENSLRDFELSGIGLSEEKQQRYGE IVARLSELNSQFSNNVLDATMGWEKLI EN 


184 


Query : 


184 


AAPLAG I PEDALAMFAAAAQSEGKTG YKIGLQI PHYLAVI Q YADNRKLREQI YRAYVTRA 


243 






A LAG+PE AL +A+S+G GY+ L+IP YL V+ Y +NR LRE++YRAY TRA 




Sbjct: 


185 


EAELAGLPESALQAAQQSAESKGLKGYRFTLEIPSYLPVMTYCENRALREEMYRAYATRA 


244 


Query : 


244 


SELSDD-GKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDLAR 


302 






SE + GK+DN+ ++ L ++ AKLLGF Y ELSLATKMA+ P+QVL+FL LA 




Sbjct : 


245 


SEQGPNAGKWDNSKVMEEILTLRVEtiAKLIiGF^TYTELSLATKMAENPQQVLDFLDHI^ 


304 


Query : 


303 


RAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKiREAKYAFSETEVKKYFPVGKVL 


362 






RAKP EK+L E+K + + G+ +L PWD+G+ EK YA ++ E + + YFP +V+ 




Sb j Ct : 


305 


RAKPQGEKELQELKGYCEKEFGVTELAPWDIGFYSEKQKQHLYAINDEELRPYFPENRVI 


364 



ml28-l 

a!28-l .pep 
ml28-l 

a!28-l .pep 
ml28-l 

al28-l .pep 
ml28-l 

a!28-l .pep 
ml28-l 

al2 8 -1 .pep 
ml28-l 

al28-l .pep 
ml2 8-l 
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Query: 363 NGLFAQIKKLYGIGFTE-KTVPVWHKDVRYFEL-QQNGETIGGVYMDLYAREGKRGGAWM 420 

+GLF IK+++ I E K V WHKDVR+F+L +N + G Y+DLYARE KRGGAWM 
Sbjct: 365 SGLFELIKRIFNIRAVERKGVDTWHKDVRFFDLIDENDQLRGSFYLDLYAREHKRGGAWM 424 

Query: 4 21 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEIXXXXXXXXXXXXXXXXQVD 4 80 

+D GR+R DG+++ P AYL CNF P+G K A +H+E+ Q+D 
Sbjct: 4 25 DDCIGRKRKLDGSIETPVAYLTCNFNAPIGNKPALFTHNEVTTLFHEFGHGIHHMLTQID 4 84 

Query: 4 81 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 540 

V+GINGV WDAVELPSQFMEN+ WE LA +S H ETG PLPKE ++L AKNFQ 
Sbjct: 4 85 VSDVAGIKGVPWDAVELPSQFMENWCWEEEALiAFISGHYETGE PLPKE KLTQLLKAKNFQ 544 

Query: 541 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 600 

MF++RQ+EF +FD ++ D + L SV+ +VAV++ ++ R +SF HIF 

Sbjct: 545 AAMFILRQLEFGIFDFRLHHTFDAEKTNQILDTLKSVKSQVAVIKGVDWARAPHSFSHIF 604 

Query: 601 XXXXXXXXXXXXWAEVLSADAYAAFEESDDV-AATGKRFWQEILAVGGSRSAAESFKAFR 659 

WAEVLSADAY+ FEE TGK F EIL GGS E FK FR 

Sbjct: 6 05 AGGYAAGYYSYLWAEVLSADAYSRFEEEGIFNPITGKSFLDEILTRGGSEEPMELFKRFR 664 

Query: 6 60 GREPS IDALLRHSGFDN 676 

GREP +DALLRH G N 
Sbjct: 665 GREPQLDALLRHKGIMN 681 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 493>: 

gl29.seq 

1 ATGCTTTCAC CTCCTCGGCG TAAAACGGCG GCACATCAAT CAAGCCGTCT 

51 TTCATTTGCG TGCGGAAAAA ATGCGGCGTG TTGCCGTGAT CAAAATCAAT 

101 ATCGTGCAGC ATCCAGCCCA AATCGCGGTT TGCCTCGCTT TCCGATAACG 

151 CCGACGGCGG CAGCGGTTCA CCCTTATCCG CGCTTTCGCC ATTTGCCCTT 

201 TCAGGCTGCG GGCATAGGGG CGGAACAGGC GGCGGTCGAA TCCTGTTTCA 

251 TCCGGACAAA CGCGTTGGCA GTCGGAAAAT CCGGCCGGCC GTGTCAAATA 

3 01 ATGCGTTACT TTGGCCGGGT CTTGTCCTTT GTAAGCGGCG GTCTTTTTTT 

3 51 GCGCGCCATC CGCATCTGTT TGGGCGCATG GCAAACGGCG GCTGCCGTAC 

4 01 AATCAAAATG TTTGGCGATT TCATGCAGAC AGGCATCCGG ATGCCGCCCG 
451 ACATATCGAG CCGGTTTTTG CCTATCCGAT TTGGCGGCAT TTAGGC CGGT 
501 AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 494; ORF 129.ng>: 

g!29.pep 

1 MLSPPRRKTA AHQSSRLSFA CGKNAACCRD QNQYRAASSP NRGLPRFPIT 

51 PTAAAVHPYP RFRHLPFQAA GIGAEQAAVE SCFIRTNALA VGKSGRPCQI 

101 MRYFG RVLSF VSGGLFLRAI RIC LGAWQTA AAVQSKCLAI SCRQASGCRP 

151 TYRAGFCLSD LAAFRPVT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 495>: 

ml29.seq (partial) 

1 . . TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 
51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 
101 GAAAATTCGG CCGGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

2 51 G C AG AT AGG C ATCCGGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

3 01 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 496; ORF 129>: 

ml29.pep (partial) 

1 . .YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGRLC QIMRYFGRVL 
51 FFVSGGLFLR VIPIC LSAXQ MVAAVQSKCL AISCRXASGC CPTYXAGFCL 
101 SDLTAFRPVT * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 129 shows 79.1% identity over a 1 10 aa overlap with a predicted ORF (ORF 129.ng) 
from TV. gonorrhoeae: 

ml29/gl29 

10 20 30 

ml2 9 pep YLRFHYLPFQAAGIGTEQVAVKSCFIQINT 

I IhMlllllllhlhlhlllh h 
gl2 9 RDQNQYRAASSPNRGLPRFPITPTAAAVHPYPRFRHLPFQAAGIGAEQAAVESCFIRTNA 
30 40 50 60 70 80 

40 50 60 70 80 90 

ml29 pep LWGKFGRLCQIMRYFGRVLFFVSGGLFLRVIPICLSAXQMVAAVQSKCLAISCRXASGC 

hill II 1 1 II Mil 111 I II II I II hi 1 1 hi I =11 1 M M M Mil MM 

gl2 9 LAVGKSGRPCQIMRYFGRVLSFVSGGLFLRAIRICLGAWQTAAAVQSKCLAISCRQASGC 
90 100 110 120 130 140 

100 no 
ml 2 9 . pep CPTYXAGFCLSDLTAFRPVTX 

III IIIMIIhlllllll 

gl2 9 RPTYRAGFCLSDLAAFRPVTX 
150 160 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 497>: 

al29.seq (partial) 

1 TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCAGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

251 GCAGATAGGC ATCCTGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 498; ORF 129.a>: 

al29.pep (partial) 

1 YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGQLC QIMRYFGRVL 

51 FFVSGGLFLR VIFIC LSA*Q MVAAVQSKCL AISCR*ASWC CPTY*AGFCL 

101 SDLTAFRPVT * 

ml29/al29 98.2% identity in 1 10 aa overlap 

10 20 30 40 50 60 

ml29.pep YLRFHYLPFQAAGIGTEQVAVKSCFIQINTLVVGKFGRLCQIMRYFGRVLFFVSGGLFLR 
I | | j | I | [ I I I I I I I I M I t I I M I II M II I M I I I : I I I I I I I I I I I I I I M I ! I f I I 
a 12 9 YLRFHYLPFQAAGIGTEQVAVKSCFIQINTLVVGKFGQLCQIMRYFGRVL FFVSGGLFLR 

10 20 30 40 50 60 

70 80 90 100 110 

ml 29. pep VIPI CLS AXQMVAAVQSKCLAI SCRXASGCC PTYXAG FCLS DLTAFRPVTX 

M I I I I I I I I M I I I I I I I I I I I I M I i I I I I I I I I M I M I I I I I I II I 
al29 VIPICLSAXQMVAAVQSKCLAISCRXASWCCPTYXAGFCLSDLTAFRPVTX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 499>: 

gl30 . seq 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTG AG CGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG TGGATCGTTC GGCGATGTCG ATGCCACTAC GGAAGCGGCA 

151 ACGCAGACCC GCATCCAGCC TGTCGGACAA TTGACGATGG GTGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 
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251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

3 01 AACGGCGACT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

3 51 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGCAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACCTACAT GGCGAATAAA 

4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 
501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

5 51 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 
601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 
651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGT AT CAAAA 
701 AAGGCAAAGA AACCTTGCAC AAACATGCCC TTGAAGGCTT TAACGCGATG 
751 CCGGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 
8 01 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 500; ORF 130.ng>: 

gl30.pep 

1 MKQLRDNKAQ GSALF TLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGI PVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAADLTDQEL KRAI T YMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 50 1>: 

ml30 . seq (partial) 

1 . . GGCGAACAGA TTTTCGGCAA AATCTGTATC CAATGCCACG CGGCGGACAG 

51 CAATGTGCCG AACGCTCCGA AACTGGAACA CAACGGCGAT TrGGCACCGC 

101 GTATCGg CAA GGCTTCGATA CCTTGTTCCA ACACGCGCTG AACGGCTTTA 

151 ACGCCATGCC TGCAAAAGGC GGTGCGGCAG ACCTGACCGA TCAGGAACTT 

2 01 AAACGGGCGA TTACTTACAT GGCGAACAAA AGCGGCGGTT CTTTCCCGAA 

2 51 TCCTGATGAG GCTGCGCCTG CCGACAATGC CGCTTCAGGA ACAGCTTCTG 

3 01 CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG CGAAGGCAGA AGACAAGGGT 
351 GCGGCAcCCC TGCGGTCGGC GTTGACGGTA AAAAAGT CTT CGAAGCAACC 

4 01 TGTCAGGTGT GCCACGGCGG TTCGATTCCC GGTATT CCCG GCATAGGCAA 
451 AAAAGACGAT TGGGCACCGC GTATCAAAAA AGGCAAAGAA ACCTTGCACA 
501 AACACGCCCT TGAAGGCTTT AACGCGATGC CTGCCAAArG CGg CAATG C A 
551 GGTTTGAGCG ATGACGAAgT CAAAGCGGCT GTTGACTATA TGGCAAACCA 
601 ATCCGGTGCA AAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 502; ORF 130>: 

m!30 .pep (partial) 

1 . . GEQIFGKICI QCHAADSNVP NAPKLEHNGD XAPRIQGFDT LFQHALNGFN 
51 AMPAKGGAAD LTDQELKRAI TYMANKS GGS FPNPDEAAPA DNAAS GTAS A 
101 PADSAAPAEA KAEDKGAAPA VGVDGKKV FE ATCQVCHGGS IPGIPGIGKK 
151 DDWAPRIKKG KETLHKHALE GFNAMPAKXG NAGLSDDEVK AAVDYMANQS 

201 GAKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 130 shows 98.1% identity over a 206 aa overlap with a predicted ORF (ORF 130.ng) 
from N. gonorrhoeae: 

ml30/gl30 

10 20 30 

ml30 pep GEQ I FGKI C I QCHAADSNVPNAPKLEHNGD 

M MINI Ml I Ml I M MM Ml I II M 

gl30 DATTEAATQTRIQPVGQLTMGDGIPVGERQGEQIFGKICIQCHAADSNVPNAPKLEHNGD 
50 60 70 80 90 100 

40 50 60 70 80 89 

ml30 pep XAPRI -QGFDTIiFQHALNGFNAMPAKGGAADLTDQELKRAI TYMANKS GGS FPNPDEAAP 

IMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M i I i 1 1 1 1 f 1 1 1 1 i 1 1 1 E 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

g l3 0 WAPRIAQGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 
110 120 130 140 150 160 
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90 100 110 120 130 140 

ml 3 0 . pep ADNAASGTASAPADSAAPAEAKAEDKGAA- PAVGVDGKKVFEATCQVCHGGS I PGI PGIG 

Illllllllllllllllllllllllllll M III I M III II I I ill IM MM III M 

gl3 0 ADNAASGTASAPADSAAPAEAKAEDKGAAAPAVGVDGKKVFEATCQVCHGGS I PGI PGIG 

170 180 190 200 210 220 

150 160 170 180 190 200 

ml 3 0 . pep KKDDWAPRI KKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 

Ml I MMM Mill ill I Ml III III II I M Ml MM I I I Ml I I II Ml I I I 

g 1 3 0 KKDDWAPRI KKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 

230 240 250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 503>: 

al30.seq 

1 ATGAAACAAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG CGGCTCGTTC GGCGATGTCG ATGCCACTAC GGAAGCAGCA 

151 ACGCAGACCC GTATCCAGCC TGTCGGACAA TTGACGATGG GCGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 

251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

301 AACGGCGATT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

351 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGTAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACTTACAT GGCGAACAAA 

4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 

601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 

7 01 AAGGCAAAGA AACCTTGCAC AAACACGCCC TTGAAGGCTT TAACGCGATG 

7 51 CCTGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 

801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 504; ORF 130. a>: 

al30 . pep 

1 MKQLRDNKAQ GSAL FTLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAVDLTDQEL KRAI T YMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 

m 1 30/al 30 97.6% identity in 206 aa overlap 

10 20 30 

ml30 .pep GEQIFGKICIQCHAADSNVPNAPKLEHNGD 

I I I I I I I M I I I M I I I II I I II I I I I I I I 
al30 DATTEAATQTRIQPVGQLTMGDGIPVGERQGEQIFGKICIQCHAADSNVPNAPKLEHNGD 
50 60 70 80 90 100 

40 50 60 70 80 89 

ml30 . pep XAPRI-QGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 
I I I I I I I I I I I I M I I 1 I I I I I I I I I I : II I I M M M ! I I I I I I I I I I I I I I II I II 
al30 WAPRIAQGFDTLFQHALNGFNAMPAKGGAVDLTDQELKRAITYMANKSGGSFPNPDEAAP 
110 120 130 140 150 160 

90 100 110 120 130 140 

ml 3 0 . pep ADNAASGTASAPADSAAPAEAKAEDKGAA- PAVGVDGKKVFEATCQVCHGGS I PG I PG IG 

I II I I M It I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I It M I I I I I I I I 
a 1 3 0 ADNAASGTAS APADSAAPAEAKAE DKGAAAPAVGVDGKKVFEATCQVCHGGS I PG I PG IG 

170 180 190 200 210 220 



ml30 . pep 



150 160 170 180 190 200 

KKDDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 
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I M I I I II I I II I I I I I I E 1! I I I M I I I I I I I II I I M I I I II I I I I I If I I I I I 
a 130 KKDDWAPRIKKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 505>: 

gl32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT ATTAATATTA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACCTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CCAACTTcct CAgccGCTCG AccGccGTTG CAGCAACAtt 

201 tttcttTGca acctgcAtgg gctatggTgt atattcacac CCACACGACA 

251 AAACACGGTT TGGACTtcag caacataCGA CAGACTCAGC AagcACCCAA 

3 01 ACCcgtAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACagtTTTT CAAATgccga caTGgtga 

This corresponds to the amino acid sequence <SEQ ID 506; ORF 132.ng>: 

gl32 .pep 

1 MEAFKTLIWI INIISALAV I VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QHTTDSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 507>: 

ml32.seq (partial) 

1 ATGGAACCCT TCAAAACCTT AATTTGGATT GTTAATTTAA TTTCCGCTTT 

51 GGCCGTCTTC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGA. . . 

This corresponds to the amino acid sequence <SEQ ID 508; ORF 132>: 

ml 3 2 -pep (partial ) 

1 MEPFKTLIWI VNLI SALAV F VLVLLQHGKG ADAGATFG . . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 132 shows 89.5% identity over a 38 aa overlap with a predicted ORF (ORF 132.ng) 
from N. gonorrhoeae: 

ml32/gl32 



10 20 30 

ml32 . pep ME P FKTL I W I VNL I S ALAVFVLVLLQHG KGADAGATFG 

II M MM IM M 1 M MM I! Ml MMM IMMI 

gl32 MEAFKTLIWI INI I S ALAVI VLVLLQHGKGADAGATFGSGSGSAQGVFGSAGNANFLSRS 

10 20 30 40 . 50 60 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 509>: 

al32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT GTTAATATAA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CTAACTTCCT CAGCCGCTCG ACCGCCGTTG CAGCAACATT 

201 TTTCTTTGCA ACCTGCATGg GCTATGGTGT ATATTCACAC CCACACGACA 

2 51 AAACACGGTT TGGACTTCAG CAACGTACAA CAAACTCAGC AAGCACCCAA 

301 ACCCGTAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACAGTTTTT CAAATGCCGA CATGGTGA 

This corresponds to the amino acid sequence <SEQ ID 510; ORF 132.a>: 

a!32 .pep 

1 MEAFKTLIWI VNIISALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QRTTNSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 



ml32/al32 



92.1% identity in 38 aa overlap 
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10 20 30 

ml32 . pep MEPFKTLIWIVNLISALAVFVLVLLQHGKGADAGATFG 
II I I I I I i M I : I M I M : M I I I 1 I II I I I I! I I I I 
al32 MEAFKTLIWIVNIISALAVIVLVLLQHGKGADAGAT FGSGSGSAQGVFGSAGNAN FLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 51 1>: 

gl34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCCGATGCGG GTAAAACCAC GCTGACCGAA AAACTGCTGC 

101 TGTTTTCGGG CGCGATTCAA AGCGCAGGCA CGGTGAAAGG TAAGAAAACC 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTACAAAGAC CACACCGTCA 

2 51 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 
301 GTTTTAACCG CAGTGGACAG CGCCTTGATG GTCATCGACG CGGCAAAAGG 

3 51 CGTGGAAGCG CAAACCATCA AACT CTTGAA CGTCTGCCGC C TGCGCG ATA 
401 CGCCGATTGT TACCTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCT 
451 TTGGAACTCT TGGACGAAGT GGAAGACATC CTGCAAATCC GCTGCGCGCC 
5 01 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCCTGCCG 
601 CACGAGTTCG ACATCATCAA AGGCATAAAC AATC C CGAAT TGGAACAACG 
651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 
701 CGGCTTCCAA CGAATTTAAT CTCGacgaAT TTCTCGccgG CGAACTCACG 
751 CCAGTGTTCT TCGGCTCTGC GATTAACAAC TTCGGCATTC AGGAAATCCT 
801 CAATTCATTG ATTGACTGGG CACCCGCACC GAAACCGCGC GACGCGACCA 
851 TGCGCATGGT CGGGCCGGAC GAG C CGAAAT TTTCCGGATT T AT CTTT AAA 
901 ATCCAAGCCA ATATGGACCC GAAACAC CGC GACCGTATCG CCTTCTTGCG 
951 CGTCTGCTCC GGTAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTAG TAACCTTCAT GTCGCACGAC 

1051 CGCGAACTGG CGGAAGAAGC CTACGCCGGC GACATCATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTTACCGG CATCCCATTC TTCGCGCCCG AACTGTTCCG CAGCGTCCGC 

12 01 AT CAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGTT TGCAACAACT 

12 51 CGG CGAAGAA GGTGCGGTTC AAGTATTCAA ACCGATGAGC GGCGCGGATT 
1301 TGATTTTGGG TGCGGTCGGC GTGTTGCAGT TTGAAGTCGT AACCTCACGC 

13 51 CTCGCCAACG AATACGGCGT GGAAGCCGTG TTCGACAGCG CATCCATCTG 
1401 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCGGAATTTG 

14 51 AAAAAGCCAA CGCAGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 
1501 TACCTCGCCC CCAACCGCGT GAATTTGGGG TTGACGCAAG AACGCTGGCC 
1551 GGACATCGTG TTCCACGAAA CGCGCGAACA TTCGGTCAAA CTCTAA 

This corresponds to the amino acid sequence <SEQ ID 512; ORF 134.ng>: 

gl34 .pep 

1 MSQEILDOVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNL.LDTPG HQDFSEDTYR 

101 VLTAVDSALM V I DAAKGVE A QTI KLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LEliLDE VED I LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERIiP 

201 HEFDIIKGIN NPELEQRFPL EIQQLRDEIE LVQAASNE FN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATMRMVGPD EPKFSGFIFK 

301 I QANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELAEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGL.QQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 
451 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AE F E KAN AGN LAI DAGGNLA 
501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 513>: 

ml34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCGGG CGCGATTCAG AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACTTCCGA CTGGATGGAA ATCGAGAAGC AGCGCGGCAT 
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201 TTCCGTGGCA TCAAGTGTGA TGCAGTTCGA TTACAAAGAC CACACCGTCA 

251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

3 01 GTTTTAACCG CCGTGGACAG CGCATTAATG GT CAT CGACG CGGCAAAAGG 

3 51 CGTGGAAGCG CAAACCATCA AGCTCTTAAA CGTCTGCCGC CTGCGCGATA 

4 01 CACCGATTGT TACGTTTATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

4 51 CTGGAACTTT TGGACGAAGT GGAAAACATT TTAAAAATCC GCTGCGCGCC 

5 01 CGTTACCTGG CCGAT CGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
5 51 TCCTGAACGA TGAAATTTAT CTCTTTGAAG CTGGCGGCGA ACGCCTGCCG 
601 C AC GAGTTCG ACATCATCAA AGGCATCGAT AATCCTGAAT TGGAACAACG 
651 CTTTCCGTTG GAAAT CCAG C AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 
701 CGGCTTCCAA CGAGTTTAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 
751 CCCGTATTCT TCGGCTCTGC GAT T AACAAC TTCGGTATTC AGGAAATCCT 
801 CAATTCATTG ATTGACTGGG CGCCCGCGCC GAAACCGCGC GACGCGACCG 
8 51 TACGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT TATCTTCAAA 
901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 
951 CGTCTGCTCC GGCAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

10 01 TCAACCGCGA AATCGCCGCC TCCAGCGTGG TTACC TT CAT GTCGCACGAC 

1051 CGCGAGCTGG TTGAAGAAGC CTACGCCGGC GACATTATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTCACCGG CATCCCATTC TTCGCACCCG AACTGTTCCG CAGCGTACGC 

12 01 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGCT TGCAACAGCT 
1251 CGGCGAAGAA GGCGCGGTGC AGGTGTTCAA ACCGATGAGC GGCGCGGATT 

13 01 TGATTTTGGG CGCGGTCGGC GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 

13 51 CTCGCCAACG AATACGGCGT AGAAGCCGTG TTCGACAGCG CATCCATCTG 
1401 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCTGAATTTG 

14 51 AAAAAGCCAA CGCGGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 
1501 TACCTCGCCC CCAACCGCGT GAATTTGGGA CTCACGCAAG AACGTTGGCC 
1551 GGACATCGTG TTCCACGAAA CACGCGAACA TTCGGTCAAA CTGTAA 

This corresponds to the amino acid sequence <SEQ ID 514; ORF 134>: 
ml34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWME IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VI DAAKGVE A QTI KLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVENI LKIRCAPVTW P I GMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQL.RDEIE LVQAASNE FN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL. IDWAPAPKPR DATVRMVEPD EPKFSGFIFK 

3 01 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 
351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 
451 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAI DAGGNLA 

5 01 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 134 shows 98.7% identity over a 531 aa overlap with a predicted ORF (ORF 134.ng) 

from N. gonorrhoeae: 
ml34/gl34 

ml34 .pep 
gl34 



10 20 30 40 50 60 

MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 

1 1 1 1 i 1 1 1 1 1 f i i 1 1 1 1 1 ! 1 1 L I [ 1 1 1 1 1 1 1 1 1 1 1 1 1 ] i 1 1 1 ! 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 = 

MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml3 4 . pep IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

11 1 I 1 1 1 I 1 1 f 1 I! I I i 1 1 I I 1 1 I 1 1 ! I I I I I I I I ! 1 ) 1 1 1 1 I I I I 

g 1 3 4 IE KQRGI SVASS VMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVI DAAKGVE A 

70 80 90 100 110 120 

130 140 150 160 170 180 
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ml3 4 pep QTIKLLNVCRLRDTPIVTFMNKYDREVRDSLELLDEVENILKIRCAPVTWPIGMGKNFKG 

i 1 1 1 I 1 1 I i 1 1 1 I II 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 M 1 1 1 M = 1 1 = I M I [ 1 1 M 1 I I I I II I I 

gl34 QTIKLLNVCRLRDTPIVTFMNKYDREVRDSLELLDEVEDILQIRCAPVTWPIGMGKNFKG 

130 140 150 160 170 180 

190 200 210 220 230 . 240 

ml34 pep VYHILNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 

M i I 1 1 I M 1 1 M II 1 1 1 M M 11 II 11 I H M 1 1 M 1 1 1 1 1 M 1 I 1 1 1 1 1 1 1 M N 1 1 1 

gl34 VYHILNDEIYLFEAGGERLPHEFDIIKGINNPELEQRFPLEIQQLRDEIELVQAASNEFN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml34 pep LDEFLAGELTPVFFGSAINNFGIOEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 

MIIIIMIMMIIIIMIMMIIMIIMIIIIIIMMhMI 1 1 1 1 1 1 1 1 1 1 M 

gl34 LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATMRMVGPDEPKFSGFIFK 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml34 pep IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 

E 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 ] 1 1 1 1 ! 1 1 M 1 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 = 1 1 f 1 1 I 

gl34 iQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELAEEAYAG 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml34 pep DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

IMIIIMMIIIIMIIIMIIIIIIIIIIIIIMIIIMMIIMIIIIIIIMIMI 

gl34 DiiGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml34 pep GAVQVFKPMSGADLI LGAVGVLQFEWTSRLANEYGVEAVFDSAS I WSARWVSCDDKKKL 

MM I MM I MM M I M M I 1 1 M II MM Ml M MM M M I M M M I M M M 

gl34 GAVQVFKPMSGADLI LGAVGVLQFEWTSRLANEYGVEAVFDSAS I WS ARWVS CDDKKKL 

430 440 450 460 470 480 

490 500 510 520 530 

ml 3 4 pep AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 

! 1 1 , 1 1 INI MM II MM I I I M M M M II M III M II I i I I I M i 

AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
490 500 510 520 530 



gl34 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 515>: 

al34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCAGG TGCGATTCAA AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTATAAAGAC CACACCGTCA 

251 ACCTTTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

301 GTTTTGACCG CCGTCGATAG TGCCTTGATG GTCATCGACG CGGCAAAAGG 

351 CGTGGAAGCG C AAAC CAT C A AACTCTTGAA CGTCTGCCGC CTGCGCAATA 

4 01 CGCCGATTGT TACGTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

4 51 CTGGAATTGC TGGACGAAGT GGAAAACATC CTGCAAATCC GCTGCGCGCC 

501 CGTAACCTGG CCGATCGGCA TGGGCAAAAA CTTCAAAGGC GTGTACCACA 

551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCTTGCCG 

601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCCGAAT TGGAACAACG 

651 CTTTCCGTTA GAAATACAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 

7 01 CGGCTTCCAA CGAGTTCAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 

7 51 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 

801 CAATTCATTG ATTGAATGGG CGCCCGCGCC GAAACCACGC GATGCGACCG 

851 TGCGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT TATCTTCAAA 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 
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951 CGTCTGCTCC GGCAAATTCG 

1001 TCAACCGCGA AATCGCCGCC 

1051 CGCGAGCTGG TTGAAGAAGC 

1101 CCACGGCAAC ATCCAAATCG 

1151 CGTTTACCGG CATCCCATTC 

1201 ATCAAAAACC CGCTGAAAAT 

12 51 TGGCGAAGAA GGTGCGGTGC 

1301 TGATTTTGGG CGCGGTCGGC 

1351 CTTGCCAACG AATACGGCGT 

14 01 GTCGGCGCGC TGGGTATCGT 

14 51 AAAAAGCCAA CGCGGGCAAC 

1501 TACCTCGCCC CTAACCGCGT 

1551 GGACATCGTG TTCCACGAAA 



AGCGCGGCAT GAAAATGAAA CACCTGCGTA 
TCCAGCGTGG TAACCTTCAT GTCCCACGAC 
CTACGCCGGC GACATTATCG GTATCCCAAA 
GCGACAGCTT CTCCGAAGGC GAACAACTGA 
TTCGCGCCCG AACTGTTCCG CAGCGTTCGC 
CAAGCAACTG CAAAAAGGTT TGCAACAGCT 
AGGTGTTCAA ACCAATGAGC GGCGCGGATT 
GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 
GGAAGCCGTG TTCGACAACG CATCCATCTG 
GCGACGACAA GAAAAAACTG GCGGAATTTG 
CTCGCCATCG ACGCGGGCGG CAACCTCGCC 
GAATCTGGGA CTCACGCAAG AACGCTGGCC 
CGCGCGAGCA TTCGGTCAAA CTTTAA 



lis corresponds to the amino acid sequence <SEQ ID 516; ORF 134.a>: 

al34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM V I DAAKGVE A QTIKLLNVCR LRNTPIVTFM NKYDREVRDS 

151 LELLDEVENI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IEWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLT FTGIPF FAPELFRSVR 

401 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

4 51 LANEYGVEAV FDNASIWSAR WVSCDDKKKL AEFEKANAGN LAI DAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

1 34/al 34 98.9% identity in 53 1 aa overlap 

10 20 30 40 50 60 

ml 3 4 .pep MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 
| | | M I | I M I i It I I I I I II I I I I I I I I I M M II I I I M I It i I M II I I It I I I I I : 
a 134 MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml3 4 .pep IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 
| M t I I i M t I M I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
a 134 IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml34 . pep QTIKLLNVCRLRDTPIVTFMNKYDREVRDSLELLDEVENILKIRCAPVTWPIGMGKNFKG 
I | | | | | I | | | I I : II I I I I I I I I I I I I I I II II I I I I I I I I : I I I I I I I M I I I I M I I I 
a 1 3 4 QT I KLLNVCRLRNT P I VT FMNKY DREVRDS LELLDE VEN I LQ I RC APVTW P I GMGKN FKG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml34 .pep VYHILNDEIYLFEAGGERLPHEFDIIKGIDN PELEQRFPLEIQQLRDEIELVQAASNEFN 

M ! | I I I M I I I I I I I I II M I I I I I I I I I M I I I I II I I I I I I I I I I I I M I I I I I II ! 
a 134 VYHILNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 

190 2 00 210 220 230 24 0 

250 2 60 27 0 280 2 90 300 

ml3 4 .pep LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 
i I I I M I I I I I I I I I I I II I I I I I I M M I I : I I I I I I I I I I I I I I I I I I I I I I I M i II 
a 134 LDEFLAGELTPVFFGSAINNFGIQEILNSLIEWAPAPKPRDATVRMVEPDE PKFSGFIFK 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml34 . pep IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 
I I I I I I I I I II M I I II I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I II I I I M I I I 
al34 IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSVVTFMSHDRELVEEAYAG 

310 . 320 330 340 350 360 
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370 380 390 400 410 420 

ml34 .pep DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 
I I { I J M II ! I i I I I I I I ! M I I : I I I I II I M I ! I I M I I I ! I I I 1 I 1 I I I I I I I M I I 
al34 DIIGIPNHGNIQIGDSFSEGEQLTFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 3 4 .pep GAVQVFKPMSGADLILGAVGVLQFEWTSRLANEYGVEAVFDSASIWSARWVSCDDKKKL 
I I I I I M i I I I I I I I I M I I I I I I 1 M I I I t I I I II I I I I I I : i I II II I I II I I I II I I 
a!34 GAVQVFKPMSGADLILGAVGVLQFEVVTSRLANEYGVEAVFDNASIWSARWVSCDDKKKL 

430 440 450 460 470 480 

490 500 510 520 530 

ml 3 4 . pep AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
I It I I M II M I I II I I 1 I I M I I I M 11 II I I I i II I I I I 1 I II M I I I I I 
al34 AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 

490 500 510 520 530 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 517>: 

gl35 . seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCG 

51 TTCGGAcgGC AGCCTCTCGC GCGGCAAAAT CCAAACCATC ACCCGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCTG CAGGGTTCGG CGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGChGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCACAAATC 

3 01 CTGCTCAGCC GTGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

3 51 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCGATTCCC ATCATCAATG 

4 01 AAAACGACAC GGTTTCGGTT GAGGAGTTGA AAATCGGCGA CAACGACACA 
451 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 
501 GACCGACATA GACGGTCTTT ACACCGGCAA CCCGAACAGC AATCCCGATG 
551 CCGTACGGCT GGACAAAATC GAACACATCA AC CAT G AAAT CATCGAAATG 
601 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 
651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 
701 CCTCACTCAA ACCCGATTCA TTGGCCGAAG CCGCCGAACA TCAGGCGGAC 
751 GGCTCGTTTT TCGTcccCcg tgCCAAAGGT TTGCGGACAC AGAAGCAATG 
801 GctggCGTTC TATTCcgaaa gcggGGgcag cgttTAtgtg gacgaaagtg 
851 cggaacacgc tTtgtccgaa caagggaaag cctgCTGA 

This corresponds to the amino acid sequence <SEQ ID 518; ORF 135.ng>: 

gl35 .pep 

1 MKYKRIVFKV GTSSITRSDG SLSRGKIQTI TRQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAIP I INENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV YICSSLKPDS LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESGGSVYV DESAEHALSE QGKAC* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 519>: 

ml35 . seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCTGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCGCAAATC 

3 01 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

3 51 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC ATCATCAATG 
401 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 

4 51 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 
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501 G AC CG AC AT A GACGGTCTTT ACACGGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

601 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 

651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 

701 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CTGCCGAACA TCAGGCGGAC 

751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAGCAATG 

801 GCTGGCGTTC T ATT CCGAAA GCCGGGGCAG CGTTTATGTG GACGAAGGTG 

851 CGGAACACGC TTTGTCCGAA CAGGGGAAAA GCCTGCTGAT GTCGGGCATT 

901 GCCGGAAT CG AAGGGCATTT TTCCCGTATG GACACCGTAA CCGTGTACAG 

951 CAAGGCAACC AAACAGCCCC TGGGCAAAGG GCGCGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATCGCGTA AGGCGAAAGG CGTGTTCATC 

1051 CATCGGGACG ACTGGATTTC CATCACGCCC GAAATACGCC TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 520; ORF 135>: 

ml35 .pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TCQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKI GDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV YICSSLKPDA LAEAAEHQAD 

2 51 GSFFVPRAKG LRTQKQWLAF YSESRGSVYV DEGAEHALSE QGKSLLMSGI 

3 01 AGIEGHFSRM DTVTVYS KAT KQPLGKGRVL FGSAAAEDLL KSRKAKGVFI 
351 HRDDWISITP EIRLLLTEF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 135 shows 97.6% identity over a 294 aa overlap with a predicted ORF (ORF 135.ng) 
from N. gonorrhoeae: 

ml35/gl35 

10 20 30 40 50 60 

ml35 pep MKYKRIVFKVGTSSITHSDGSLSRGKIQTITCQIoAALHHAGHELVLVSSGAVAAGFGALG 

MM MMI MIIMMI! MIMM I M M IIIIIIIIIIIIIIIIIMIIIIIIIM 

g i35 MKYKRIVFKVGTSSITRSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 3 5 . pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

Ml I IN II I III Ml Ml i Ml I II I M 1 1 II I Ml III III Mil HI lull MM I 

g 1 3 5 FKKRPVKI ADKQAS AAVGQGLLMEE YTANLS SDGIVSAQI LLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 3 5 . pep SVLLQRRAVP 1 1 NEND TVS VEELKI GDNDTLS AQVAAMI QADLLVLLTD I DGLYTGNPNS 

I I I I 1 1 I I : < I I II M I I ! ! M I I I I II I I I ! M M : I I I 1 1 I I I I I M ! M I I ! I I I ! I 

g i3 5 SVLLQRRAIPI I NENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDI DGLYTGNPNS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml3 5 .pep NPDAVRLDKI EH I NHE 1 1 EMAGGSGSANGTGGMLTKI KAAT I AAESGVPV YICSSLKPDA 

Ml I IMIIIIMI Ml 1 1 1 1 1 I I 1 1 I 1 1 1 1 1 1 I 11 II I f 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 = 

gl3 5 NPDAVRLDKI EH I NHE 1 1 EMAGGSGSANGTGGMLTKI KAAT I AAESGVPV YI CSSLKPDS 

190 200 210 220 230 240 

25 0 260 270 280 2 90 300 

ml 3 5 . pep LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 

MM II MM I Ml IM Ml Ml MIMM MM 1 1 1 1 II M I 1 1 M I II I M 

gl3 5 LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESGGSVYVDESAEHALSEQGKACX 

250 260 270 280 290 

310 320 330 340 350 360 
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ml35 .pep 



AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 



The following partial DNA sequence was 

al35. seq 

ATGAAATACA AAAGAATCGT 
TTCGGACGGC AGTCTCTCGC 
TTGCCGCATT GCATCATGCG 
GCGGTTGCGG CAGGGTTCGG 
AATCGCCGAC AAACAGGCTT 
AAGAATATAC GGCAAACCTG 
CTGCTCAGCC GCGCCGACTT 
CGGCGCACTT TCCGTGCTGC 
AAAACGATAC GGTTTCGGTT 
TTGAGTGCGC AAGTGGCGGC 
GACCGACATA GACGGTCTTT 
CCGTACGGCT GGACAAAATC 
GCGGGCGGCT CGGGTTCGGC 
CAAAGCGGCG ACGATTGCGA 
CCTCGCTCAA ACCCGATGCA 
GGCTCGTTTT TCGTCCCCCG 
GCTGGCGTTC TATTCCGAAA 
CGGAACACGC TTTGTCCGAA 
GCCGGAATCG AAGGGCATTT 
CAAGGCAACC AAACAGCCTT 
CCGCCGCCGA AGACCTGCTC 
CATCGGGACG ACTGGATTTC 
CGAATTTTAG 



identified in N. meningitidis <SEQ ID 52 1>: 



l 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



ATTTAAAGTC 
GCGGCAAAAT 
GGACACGAGC 
TGCGCTGGGT 
CCGCCGCCGT 
TCTTCAGACG 
TGCCGACAAA 
TGCAACGCCG 
GAGGAATTGA 
GATGATACAG 
ACACCGGCAA 
GAACACATCA 
AAACGGCACA 
CCGAGTCCGG 
CTTGCCGAAG 
TGCCAAAGGT 
GCAGGGGCGG 
CAGGGAAAAA 
TTCCCGTATG 
TGGGCAAAGG 
AAATTGCGTA 
CATCACGCCC 



GGCACATCTT 
CCAAACCATC 
TGGTCTTGGT 
TTCAAAAAAC 
CGGGCAGGGG 
GCATCGTGTC 
CGCCGCTACC 
CGCCGTCCCC 
AAATCGGCGA 
GCAGACCTCT 
CCCGAACAGC 
AC CAT G AAAT 
GGCGGTATGC 
CGTACCGGTC 
CGGCAGATAA 
TTGCGGACGC 
CGTTTATGTG 
GCCTGCTGAT 
GACACCGTAA 
GCGAGTCCTG 
AGGCGAAAGG 
GAAATACGCC 



CGATTACCCA 
ACCCGCCAGC 
GTCTTCCGGC 
GTCCGGTCAA 
CTGCTGATGG 
CGCACAAATC 
AAAATGCCGG 
ATCATCAATG 
CAACGACACA 
TGGTGCTGCT 
AATCCCGATG 
CAT C G AAAT G 
TGACTAAAAT 
TATATCTGTT 
TCAGGCGGAC 
AGAAGCAATG 
GACGAAGGTG 
GTCGGGCATT 
CCGTGTACAG 
TTCGGCTCTG 
CGTGTTCATC 
TGCTTCTGAC 



This corresponds to the amino acid sequence <SEQ ID 522; ORF 135.a>: 

al35 .pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TRQLAALHHA 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI 

201 AGGSGSANGT GGMLTKIKAA TIATESGVPV YICSSLKPDA 

251 GSFFVPRAKG LRTQKQWLAF YSESRGGVYV DEGAEHALSE 

301 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL 

351 HRDDWISITP EIRLLLTEF* 



GHE LVLVSSG 
SSDGIVSAQI 
EELKIGDNDT 
EHINHEIIEM 
LAEAADNQAD 
QGKSLLMSGI 
KLRKAKGVFI 



ml35/al35 98.4% identity in 369 aa overlap 

10 20 



30 



40 



50 



60 



m!35 . pep MKYKRIVFKVGTSSITHSDGSLSRGKIQTITCQLAALHHAGHELVLVSSGAVAAGFGALG 
I I I M M I II II I I I I I 1 1 I I M I f I II I I I I i I I I I I II I I. I I I I I I I I I I I I I I I I I 
al35 MKYKRIVFECVGTSSITHSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml35 . pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 
I M I I I I I I I M II I I I I I I I I M I I I I I II II I I I I I I I I I M I I II I I I I I I I I I I I I 
al35 FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml35.pep S VLLQRRAVP I INENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDI DGLYTGNPNS 

I I I I I I I 1 I I I I I 1 I I I IE I M II I I I I I I I I I I j I I I I M I I I I M I I II 1 I M 1 I I I I 
al35 SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDI DGLYTGNPNS 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 3 5. pep NPDAVRLDKIEHINHEIIEMAGGSGSANGTGGMLTKIKAATIAAESGVPVYICSSLKPDA 
I I I I I i 1 I I I I I I I I M I I I I I I I I I I I M I I I I I I I M I I I I : M I I I I I I I I I I II I I 
al35 NPDAVRLDKIEHINHEIIEMAGGSGSANGTGGMLTKIKAATIATESGV PVYICSSLKPDA 

190 200 210 220 230 240 
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250 260 270 280 290 300 

ml 3 5 .pep LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 
I I I I I : : II I I I M I I I I M I I I I I ! I I I I I I I I I ( : M I I I I it I I i I I I I I I 1 I I I I I 
a 13 5 LAEAADNQADGSFFVPRAKGLRTQKQWLAFYSESRGGVYVDEGAEHALSEQGKSLLMSGI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 3 5. pep AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 
I I I I 1 II M I I I I I I I I I I I I I I I I I I I M M I II II M II M I I I I I II I I I I I I I M 
al35 AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKLRPCAKGVFIHRDDWISITP 

310 320 330 340 350 360 

370 

ml 3 5 . pep EI RLLLTE FX 

I II II 11 M I 
,al35 EI RLLLTE FX 

370 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 523>: 

g!36 . seq 

1 ATGGAAATCC GGTTTCAGAC AGCATTTTTA CGTTTGGTTC AGatgaAAAC 

51 AAACGCTtca aTTCTtaccg caACACGCCT TGTATTTCCT GccgCTGCCG 

101 CACGGACAGG GATCGTTCCT GCCGgtTTTT TCCCCTTCCC TGCGGACGGT 

151 TTG CGGTTTG TTGATGACCG CCTGCCAGTA GCGGTAGATG TCtgccagcg 

201 cgTAAGGCag tTCGGAcgca agttccgcca gctcgccttc ggTGAATTGC 

251 AGgcggataa cgccgtttTC CTCTTCGTCg taaatgccgc ccactgccat 

3 01 cacgGGGTAA AACAGCTCTT CAAACGCTTC ATCATCGGCG GCTTCAAACC 
351 AATCGGTCGG CACAATGTCC AAAC CGTAAA GATAGGCGTT GCACCAAGTG 

4 01 TAAAAATCGC TGCCGCCCTC GCCGTCGTCG TAGAGCCACA AATCGGGCAG 
4 51 CTTTTTATCC GACATCGCGG CGGTTGTTTC CATCGCCATT GCCAAAACCA 
501 GCCGTTCGAT TTCGGAACGT TCGGCGGCGG TAAATTGCGA TTCGTCGCCC 
551 AACACTTCGG GCAGCCAGTC GAGCGGTGCC AATTTGTCCG GCCCGCTCAA 
601 CAGCGCCGTC ATAAAACCTT GAACCTCGTC GCAACGCATC GTGTTGCCTT 
651 GTTCGCTTTT GGCATCCAAT AA 

This corresponds to the amino acid sequence <SEQ ID 524; ORF 136.ng>: 

gl36 .pep 

1 MEIRFQTAFL RLVQMKTNAS ILTATRLVFP AAAARTGIVP AGFFPFPADG 

51 LRFVDDRLPV AVDVCQRVRQ FGRKFRQLAF GELQADNAVF LFWNAAHCH 

101 HGVKQLFKRF IIGGFKPIGR HNVQTV KIGV APSVKIAAAL AWV EPQIGQ 

151 LFIRHRGGCF HRHCQNQPFD FGTFGGGKLR FVAQHFGQPV ERCQFVRPAQ 

201 QRRHKTLNLV ATHRVALFAF GIQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 525>: 

ml36 . seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGTTTGTT GATGACTGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTC C AG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGCG CCGTTTTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

3 01 TT C AAAC CAA TCGGTCGGCA CAATATCCAA AC CGT AAAGA TAAGCATTGC 
351 AC C ATGTGT A AAAATCGCTG CCGCCGTCTT CGTTTTCATA C AG C C AC AAA 

4 01 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 
451 CAAAACCAGC CGTT CGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 
501 CGTCGCCCAA CACTTCGGGC AG CCAGTCGA GCGGTGTCAA TTTGTCCGGC 
551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 
601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 
651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CAAATGGGTT 
701 TTGCGCCCTA TTATCGCCGC AATGCCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 526; ORF 136>: 
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ml3 6 .pep 

1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRFV DDCLPVAVDI 

51 RQCIRQLGFQ FRQLAFCELQ TDSAVFLFW NTAQCHDGIK QLFKRF1IDG 

101 FKPIGRHNIQ TV KISIAPCV KIAAAVFV FI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFG I QQ FAQPPFGCFG KFSGIHHFPF QMGFAPYYRR NAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 136 shows 85.6% identity over a 209 aa overlap with a predicted ORF (ORF 136.ng) 
from N. gonorrhoeae: 

ml36/gl36 

10 20 30 40 

ml 3 6 oec METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPV 

* P hMMMMIMM Ml I II MMIMIIII Ml 

gl36 MEIRFQTAFLRLVQMKTNASILTATRLVFPAAAARTGIVPAGFFPFPADGLRFVDDRLPV 

10 20 30 40 50 60 

50 60 70 80 90 100 

ml3 6 net) AVDIRQCIRQLGFQFRQLAFCELQTDSAVFLFWNTAQCHDGIKQLFKRFIIDGFKPIGR 

IN: I HIM MIMM 1 1 1 = I = II M 1 1 1 h h 1 1 MINIUM IIIMM 

gl36 AVDVCQRVRQFGRKFRQLAFGELQADNAVFLFWNAAHCHHGVKQLFKRFIIGGFKPIGR 

70 80 90 100 110 120 

HO 120 130 140 150 160 

ml36 oeo HNIQTVKISIAPCVKIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLR* 

MM Ml MM I 111111= I = = I M 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M I N 1 1 1 

HNVQTVKIGVAPSVKIAAALAVWEPQIGQLFIRHRGGCFHRHCQNQPFDFGTFGGGKLR 
130 140 150 160 170 180 



9136 



170 180 190 200 210 220 

ml3 6 pep FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIH 

M M 1 1 1 1 1 II I II M 1 1 1 1 M 1 1 1 1 M 1 1 1 1 II I M 1 1 1 1 1 1 

gl3 6 FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQX 

190 200 210 220 

230 240 
ml 3 6 . pep HFPFQMGFAPYYRRNAVX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 527>: 

al36 . seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGCTTGTT GATGACCGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGTG CCGTTGTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAGCATTGC 

351 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

401 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 

4 51 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

501 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 

551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 

601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 

651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CCAATGGGTT 

701 TTGCGCCCTA TTATAGTGGA TTAAATTTAA ATCAGGACAA GGCGACGAAG 

7 51 CCGCAGACAG TACAAATAGT ACGGCAAGGC GAGGCAACGC CGTACTGGTT 

801 TAAATTTAAT CCACTATATC GCCGCAATGC CGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 528; ORF 136.a>: 

al36 .pep 
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1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRLV DDRLPVAVDI 

51 RQCIRQLGFQ FRO LAFCELQ TDSAVVLFVV NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TVK ISIAPCV KIAAAVFVFI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF PMGFAPYYSG LNLNQDKATK 

251 PQTVQIVRQG EATPYWFKFN PLYRRNAV* 

ml36/al36 98.3% identity in 238 aa overlap 

10 20 30 40 50 60 

ml36 pep METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPVAVDIRQCIRQLGFQ 
[ I M I I I I I M i I I M ! I I I M I I It I I I I II It II I I : I M I I I I I I I I I I M M I I I 
al36 METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRLVDDRLPVAVDIRQCIRQLGFQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml36 pep FRQLAFCELQTDSAVFLFVVNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 
I n I I I I If I M I I 1 M M I I I I M M M I I I M I I I I I I I I I I I I II I I I I I I I If I I 
al36 FRQLAFCELQTDSAWLFVVNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 3 6 pep KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

| | | M I I I I I M ) | [ M I I I M I I I I I I I M I I I I I I M I I I I I II I I I I I I II 

al36 KIAAAVFVFI QPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml36 pep FVR PAQQRRHKT LNLVATHRVALFAFG I QQ FAQ PPFGCFGKFSG I HHFPFQMGFAPYYRR 

| I | I I I I M I I I I I E I I I M II I I M I I M I I M I I I I II I I I I! I M I I I I I I I I I 
al36 FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFPMGFAPYYSG 

190 200 210 220 230 240 

ml 3 6. pep NAVX 

al3 6 LNLNQDKATKPQTVQIVRQGEATPYWFKFNPLYRRNAVX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 529>: 

gl37 . seq 

1 ATGATTATCC ATCACcaaTT CGATCCCGTC CTCATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCT TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TGATTTTGGG 

2 01 CGGACGCTTG GGCTATGTCC TGTTTTACAA ATTCTCCGAC TACCTCGCCC 
251 ATCCGCTTGA TAT TTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

3 01 GGCTTTTTGG GTGTAGTTAT TGCCATATGG TTGTTCAGCC GCAAGCACGG 
351 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCGCCGCTC GTTCCGCTGG 

4 01 GTCTCGCTTC GGGACGTATC GG CAACTTT A TCAACGGCGA ACTTTGGGGA 
451 CGCATTACCG ACATTAACGC ATTTTGGGCA ATGGGCTTCC CGCAAGCGCA 
501 TTACGAAGAT GCCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 
551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 
601 GCCCTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TTTCCAAAAA 
651 ACCGCGCCCG ACCGGGCAGA CTGCCGCGCT TTTTCTCGGC GGCTACGGCG 
701 TGTTCCGCTT TATTGCCGAA TTTGCGCGCC AACCCGACGA CTATCTCGGG 
751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 
801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 
851 GA 

This corresponds to the amino acid sequence <SEQ ID 530; ORF 137.ng>: 

gl37 .pep 
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1 MIIHHQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGGMSFHG 

101 GFLGWIAIW LF SRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RITDINAFWA MGFPQAHYED AEAAAHNPLW AEWLQQYGML PRHPSQ LYQF 

201 ALEGICLFAV VWL FSKKPRP TGQTAALFLG GYGVFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 53 1>: 

ml3 7 . seq 

1 ATGATTACCC 

51 TGCCGTCCGC 

101 CCTTTCTCGG 

151 GAATCGCTCG 

2 01 CGGGCGTTTG 

2 51 ATCCGCTTGA 

3 01 GGCTTTTTGG 
351 CATCGGCTTC 

4 01 GTCTCGCTTC 

4 51 CGCGTTACCG 

5 01 TTACGAAGAT 
551 TGCAACAATA 
601 GCACTTGAAG 

6 51 ACAGCGGTCG 

7 01 TATTCCGCTT 

7 51 CTGCTGACCT 

8 01 TGTTTTGGGT 
851 GA 

This corresponds to the amino acid sequence <SEQ ID 532; ORF 137>: 

m!3 7.pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MSFHG 

101 GFLGWIAIR LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED AEAAAHNPLW AEWLQQYGML PRHPS QLYQF 

2 01 ALEGICLFTV IWL FSKKQRS TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 137 shows 95.4% identity over a 283 aa overlap with a predicted ORF (ORF 137.ng) 
from TV. gonorrhoeae: 

ml37/gl37 

10 20 30 40 50 60 

MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

II I I I I I i M ! I II !! I MM I M: I I I I I M II I MM II i I i I Mill I M M | M 

Mil HHQ FD P VL I S I G P LAVRW YALS Y I LG F I L FT FLGRRR I AQG L S VF T KE S LDD F LTWG 
10 20 30 40 50 60 

70 80 90 100 110 120 

ILGVI LGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIRLFGRKHGIGF 

I 1 1 1 ! 1 1 1 1 E 1 1 1 1 1 1 1 1 1 M I 1 1 M M I M I I M M M M 1 11 1 M 1 1 MMIIMM 

ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIWLFSRKHGIGF 
70 80 90 100 110 120 

130 140 150 160 170 180 

LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 

M I M M M I MM MM 1 1 MM MMM MM Ml MM Ml I Mi MM M M III I 

LKLMDTVAPLVPLGLASGRIGNFINGELWGRITDINAFWAMGFPQAHYEDAEAAAHNPLW 
130 140 150 160 170 180 

190 200 210 220 230 240 

AEWLQQYGMLPRHPSQLYQFALEGI CLFTVI WLFSKKQRSTGQVASLFLGGYGI FRFIAE 



ATCCCCAATT CGATCCCGTC 
TGGTATGCCC TAAGCTACAT 
CAGAAGGCGC ATCGCGCAAG 
ACGACTTCCT GACATGGGGC 
GGTTACGTCC TGTTTTACAA 
TATTTTCAAG GTATGGGAAG 
GTGTAGTTAT TGCCATACGG 
CTCAAACTGA TGGATACGGT 
GGGACGTATC GGCAACTTCA 
ACATCAACGC ATTTTGGGCA 
GCCGAAGCCG CCGCGCACAA 
CGGTATGCTG CCGCGTCATC 
GCATCTGCCT GTTCACCGTC 
ACCGGACAAG TCGCCTCGCT 
CAT TG CCGAA TTCGCACGCC 
TGGGGCTGTC GATGGGGCAA 
ATCGTCGGCT TTGTCCGGTT 



CTTATCAGTA TCGGCCCGCT 
CCTCGGATTT ATTCTTTTTA 
GCTTGTCCGT TTTTACCAAA 
ATTTTGGGCG TAATTTTGGG 
GTTTTCCGAC TACCTCGCCC 
GCGGAATGTC GTTCCACGGC 
TTGTTCGGCC GCAAACACGG 
CGCACCGCTC GTTCCGCTGG 
TCAACGGCGA ACTTTGGGGA 
ATGGGCTTCC CGCAGGCGCG 
TCCGCTTTGG GCAGAATGGC 
CCTCGCAGCT TTATCAGTTT 
ATTTGGCTGT TCTCTAAAAA 
CTTCCTCGGC GGCTACGGCA 
AACCCGACGA CTATCTCGGG 
TGGTTGAGCG TCCCGATGAT 
CGGCATGAAA AAACAGCACT 



m!3 7 .pep 
gl37 

ml37 .pep 
gl37 

ml37 .pep 
gl37 

ml37 .pep 
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1 I I II I ! ' I MM I II II I I II M I ; 1 MM I I i I | I I -■ I : I I I I I I h , I l I I! 

gl37 AEWLQQYGMLPRHPSQLYQFALEGICLFAWWLFSKKPRPTGQTAALFLGGYGVFRFIAE 

190 200 210 220 230 240 

250 260 270 280 

ml 3 7 pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

! II IN llll MM II II I II MINI I II MUIII II I I I II 

g 13 7 FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 533>: 

al37.seq 1 

1 ATGATTACCC ATCCCCAATT CGACCCCGTC CTTATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCC TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TAATTTTGGG 

201 CGGGCGTTTG GGTTACGTCC TGTTTTACAA GTTTTCCGAC TACCTCGCCC 

251 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

301 GGCTTTTTGG GTGTAGTTAT TGCCATATGG TTGTTCGGTC GCAAACACGG 

351 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCACCGCTC GTTCCACTGG 

4 01 GTCTCGCTTC GGGACGTATC GGCAACTTCA TCAACGGCGA ACTTTGGGGA 

4 51 CGCGTTACCG ACATCAACGC ATTTTGGGCA ATGGGCTTCC CGCAGGCGCG 

501 TTACGAAGAC CTCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 

551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 

601 GCACTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TCTCTAAAAA 

651 ACAGCGGCCG ACCGGACAAG TCGCCTCACT CTTCCTCGGC GGCTACGGCA 

701 TATTCCGCTT CATTGCCGAA TTTGCACGCC AACCCGACGA CTATCTCGGG 

751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG T CC CG AT GAT 

801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 

8 51 GA 

This corresponds to the amino acid sequence <SEQ ID 534; ORF 137.a>: 

al37.pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVTLGGRL GYVLFYKFSD YLAHPLDIFK VWEGGMSFHG 

101 GFLGWIAIW LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED LEAAAHNPLW AEWLQQYGML PRHPSQLYQF 

201 ALEGICLFAV VWL FSKKQRP TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

ml37/al37 98.2% identity in 283 aa overlap 

10 20 30 40 50 60 

ml37 .pep MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
| | | | | | | | | M I I I I I I I II I I I I I I I I I I I I M I M I I I I I II I I I I I I I t I I I I 1 I I I 
a 137 MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml37 .pep ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIRLFGRKHGIGF 
M | I I II I I I I I I I II I I I I I I I 1 I M I I I I M M I I I I I I II I M I II t I I II I I 11 1 
a 137 I LGVI LGGRLGYVLFYKFS DYLAHPLDI FKVWEGGMS FHGG FLGWI AI WLFGRKHGIGF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml37 . pep LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 
I I I I M I II I I I I M I I I I I I I I I I I I I II I 1 I II I I I I I I I I I I 1 I I I I I I I I 1 II I I 
al37 LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDLEAAAHNPLW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml37 . pep AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 
I I I I I I I I I I I I I I I I I I I I I I I I M I I : I : I I I I I I I I 1 M I I I I I I I I 11 I I I I I II 
al37 AEWLQQYGML PRH PS QLYQFALEG I CLFAVVWLFSKKQRPTGQVASLFLGGYG I FRFIAE 
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190 200 210 220 230 240 

250 260 270 280 

ml 37 . pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 
t I I t I I I I M M I 1 I I I I I I I I I I I I I I I I I I i I I I I I I I I I M 
al37 FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 535>: 
gl38 . seq 

1 ATGGAGTTTG AAAACATTAT TTCCGCCGCc gaCAAGGCGC GTATCCTTGC 

51 CGAAGCACTG CCTTACAtcc gccgGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AGTATGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GCGAATTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGAC GATGGATATT 

301 GTCGAAATGG TATTGGGCGG GCACGTCAAC AAGGAAATCG TGTCGATGAT 

3 51 TAACACATAT GGAGGGCACG CGGTCGGCGT GAGCGGGCGC GACGACCATT 
401 TCATTAAGGC GAAGAAACTT TTGGTCGATA CGCCCGAACA GAATAGCGTG 
451 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 
501 AGGGCTGATA GAACGCGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 
551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT GGCAGGCAAA 
601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAAtatcgc 
651 cgGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC acgCCGAAAC 
701 GGATTGATGG GCTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 
751 AAAATCGCTT CTGCGGTCGA AGCcgccgtc aACGGTGTGA AAGCCACGCA 
801 CATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 
851 ATGCCGGTAT CGGGTCGATG ATTTTAGGCA GAGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 536; ORF 138,ng>: 

g!38.pep 

1 MEFENXISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 

51 RDWLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKETMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFI KAKKL LVDTPEQNSV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF N I NADLVAG K 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDGLIA DGTLYGGMLP 

251 KI AS AVE AAV NGVKATHI I D GRLP NALLLE IFTDAGIGSM IL GRGEDA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 537>: 

tal3 8 . seq 

1 ATGGAGT CTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

4 01 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

4 51 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

5 01 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 
551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

6 01 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 
651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 
701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 
751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGTGTGA AAGCCACGCA 
801 TATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 
851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 538; ORF 138>: 
ml3 8 -pep 

1 MESENIISAA DKARILAEAL, PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 
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51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDELIA DGTLYGGMLP 

251 KIASAVEAAV NGVKATHI ID GRLPN ALLLE IFTDAGIGSM I L GGGEDA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 98.0% identity over a 298 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae: 

ml38/gl38 

10 20 30 40 50 60 

ml 3 8 pep ME SEN 1 1 SAADKARIIiAEALPYI RRFSGSVAVI KYGGNAMTEPAliKEGFARDWLLKLVG 

II IIIIIMIIMIIIIIIMIIIIIIIIMIMIMIIIIIIIIIIMMIIIIIIII 

gl38 MEFENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDVVLLKLVG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml38 pep IHPVIVHGGGPQINANLEKVGKKGEFVQGI^VTDKEAMDIVEMVLGGHVNKEIVSMINTY 

M 1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 M ! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M I II 1 1 1 1 1 

gl38 IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKETMDIVEMVLGGHVNKEIVSMINTY 

70 80 90 100 110 120 



ml3 8 .pep 



130 140 150 160 170 180 

GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVES IDTGLVKGLIERGCI PWAP 

MMM MM II 1 1 1 IMIIMi II 1 1 M MM M I Ml M MM 1 1 M M MM M M M 

gl3 8 GGHAVGVSGRDDHF I KAKKLLVDTPEQNSVD I GQVGTVES IDTGLVKGLI ERGCI PWAP 

130 140 150 160 170 180 



ml3 8 .pep 
gl38 

ml3 8 .pep 
gl38 



190 200 210 220 230 240 

VGVGEKGEAFNINADLVAGKLAEEIJNAEKLL^IMTNIAGVMDKTGNLLTKLTPKRIDELIA 

Ml M II M M M M MMMMM M MMM M M Ml MM M M I 1 1 1 1 ' I M I 1 1 

VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNIAGVMDKTGNLLTKLTPKRIDGLIA 
190 200 210 220 230 240 

250 260 270 280 290 299 

DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFTDAGIGSMILGGGEDAX 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 Mill 

DGTLYGGMLPKI ASAVEAAVNGVKATHI IDGRLPNALLLE I FTDAGIGSMI LGRGEDAX 
250 260 270 280 290 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 539>: 

al38 . seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

4 01 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

4 51 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATAT CGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

7 01 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGCGTGA AAGCCACGCA 

801 TATCATCGAC GGCAGGGTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 
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This corresponds to the amino acid sequence <SEQ ID 540; ORF 138.a>: 

al38.pep 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 

51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

2 01 LAEELNAEKL LMMTNIAGVM DKTGN LLTKL TPKRIDELIA DG TLYGGMLP 

251 K I AS AVE AAV NGVKATHIID GRVPNALLLE IFTDAGIGSM ILGGGEDA* 

ml 38/al 38 99.7% identity in 298 aa overlap 

10 20 30 40 50 60 

ml 3 8. pep MESENIISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDVVLLKLVG 
I I I I i I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I M I I I I I I I I I I! I I I I I I I I 
al38 ME SEN 1 1 S AADKAR I LAE AL P Y I RR FSG S VAV I K YGGN AMTE PALKEG FAR DVVL LKLVG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 3 8 . pep IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDIVEMVLGGHVNKEIVSMINTY 
I I I I I I ! I I I I I I I I I I I I I 1 I I I I It II I I I I I I I I I M I I I I I I I I M M I 1 I I I I I I 
al3 8 IHPVIVHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDIVEMVLGGHVNKEIVSMINTY 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml38.pep GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVESIDTGLVKGLIERGCIPWAP 
M I I I I I I I I I I I I I I M I I I I I I I I I I I I I t I I I I I I I 11 M I E I I I M II I I I I II I 1 
al3 8 GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVESIDTGLVKGLIERGCIPWAP 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 3 8 . pep VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTN I AGVMDKTGNLLT KLT PKRI DELIA 

I M I I 1 I I I M I 1 I! I I I I I I I I I I 1 i I I I I M M I II I I M I M I I I I 1 I t I I I I 1 M I 
al38 VGVGEKGEAFN INADLVAGKLAEELNAEKLLMMTN IAGVMDKTGNLLTKLTPKRI DELI A 

190 200 210 220 230 240 



250 260 270 280 290 299 

ml3 8 .pep DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFTDAGIGSMILGGGEDAX 
I M I I I I I I II I I I M I I I I I I I I I I I I I I M : I I I I I I M I I I I I M II U I I I I I II 
al38 DGTLYGGMLPKI AS AVEAAVNGVKATHI I DGRVPNALLLE I FTDAGIGSMILGGGEDAX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 54 1>: 

gl39 . seq 

1 ATGCGAACCA CCTCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAggc ggcggcggag 

101 gcGGCACTTC TGCTCCCGAC TTTAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACGATAGC GGAATCAGCA GCAGTATCTT ACGCCGGTAT 

2 01 AAAAAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

2 51 ACGT TGCGGT TACAGACAGG GATGCCAAAA TCAAAGCCCC CCGAATCTGC 

3 01 ATACCGGAGA CTTTTCAAAC CCAAATGACC AATATTAAGA ATATGATCAA 

3 51 CCTCAAACCT GCAATTGAAG CAGGCTATAC AGGACGCGGG GTAGAGGTAG 

4 01 GTATCGTCGA TACAGGCGAA TCCGTCGGCA GCATATCCTT TCCCGAACTG 

4 51 TATGGCAGAA AAGAACACGG CTATAACGAA AATTACAAAA ACAAATTACA 

5 01 AAAACTATAC GGCGTATATG CGGAAGGAAG CGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 542; ORF 138.ng>: 

gl3 9 .pep 

1 MRTTSTFPTK TFKPAAMALA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATIAESA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKIKAPRIC 

101 I PETFQTQMT NIKNMINLKP AIEAGYTGRG VEVGIVDTGE SVGSISFPEL 

151 YGRKEHGYNE NYKNKLQKXiY GVYAEGSA* 
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The following partial DNA sequence was identified in N, meningitidis <SEQ ID 543>: 



ml39 . seq 








1 


ATGCGAACGA 


CCCCAACCTT 


CCCTACAAAA 


51 


GGCGTTAGCT 


GTTGCAACAA 


CACTTTCTGC 


101 


GCGGCACTTC 


TGCGCCCGAC 


TTCAATGCAG 


151 


AACAGCAGAG 


CAACAACAGC 


GAAATCAGCA 


201 


CAAGAACGAA 


ATGTGCAAAG 


ACAGAAGCAT 


251 


ACGTTGCGGT 


TACAGACAGG 


GATGCCAAAA 


301 


TGCATACCGG 


AGACTTTCCA 


AACCCAAATG 


351 


CAACCTCAAA 


CCTGCAATTG 


AAGCAGGCTA 


401 


TAGGTATCGT 


CGACACAGGC 


GAATCCGTCG 


451 


CTGTATGGCA 


GAAAAGAACA 


CGGCTATAAC 


501 


CGGCGTATAT 


GCGGAAGGAA 


GCGCCTGA 



This corresponds to the amino acid sequence <SEQ ID 544; ORF 138>: 

ml3 9 .pep 

1 MRTTFTFPTK TFKPTAMA LtA VATTLSACLG GGGGGTSAPD FNAGGTG I GS 

51 NSRATTAKSA AVS YAGI KNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 

101 CIPETFQTQM THYKNLINLK PAI EAGYTGR GVEVGIVDTG ESVGSISFPE 

151 LYGRKEHGYN ENYEKLYGVY AEGSA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 92.2% identity over a 179 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae: 

ml39/gl39 



10 20 30 40 50 60 

ml 3 9 . pep MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPD FNAGGTG I GSNS RATTAKS A 

Mil MMMMhllMMIIIIIIIIIIMIIMIIMMIIIIMIIIIII hll 

gl3 9 MRTTS T F PT KTF KPAAMALAVATTLS ACLGGGGGGTS APD FNAGGTG I G SNS RAT I AE S A 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml3 9 . pep AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 

IMI M I I I I : i ! I ! 1 1 1: ! : II II 1 1 I IN I hi IIIMIIMIIIh Ihllll 

gl3 9 AVS YAGI KNEMCKDRSMLCAGRDDVAVTDRDAKIKAP - RI CI PETFQTQMTNI KNMINLK 

70 80 90 100 110 



130 140 150 160 170 
tnl3 9 . pep PAIEAGYTGRGVEVGI VDTGESVGS I S FPELYGRKEHGYNENY EKLYGVYAEGSAX 

I I I I I I M II I I I II I I I I I I I I I I I I I I M I M II I I I I I II = 1 I I 1 I I I I I I I 1 

g 1 3 9 PAI EAGYTGRGVEVGI VDTGE SVGS I SFPELYGRKEHG YNENYKNKLQKL YGVYAEGS AX 

120 130 140 150 160 170 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 545>: 

al39 . seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACAACAGC GAAATCAGCA GCAATATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTACA AACCCAAATG AC G CAT . AC A AGAATTT GAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

4 01 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

4 51 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTAC. AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 546; ORF 139.a>: 

al39 .pep 

1 MRTTPTFPTK TFKPAAMALA VATTLSACLG GGGGGTSAPD FNAGGTG IGS 
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51 NSRATTAKSA AISYAGIKNE MCKDRSMLCA GRDDVAVTDR DAK I NAP PR I 
101 CIPETLQTQM THXKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 
151 LYGRKEHGYN ENYXKLYGVY AEGSA* 

ml39/al39 97.1% identity in 175 aa overlap 

10 20 30 40 50 60 

ml39 pep MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

I I M I I I 1 I I I I I I : I I I I t I II I I I I I I I I I I M M I I I It II I M It I I It I I I I I M 
al3 9 MRTTPTFPTKTFKPAAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 3 9 pep AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 

I : | I | I I I I ! I II I i I I I I I 1 I I I I M II I I I I I I I M I I I I I I I : I M I I I I I I I t I I 
al 39 AI SYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICI PETLQTQMTHXKNLINLK 

70 80 90 100 110 120 

130 140 150 160 170 

ml39 pep PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYEKLYGVYAEGSAX 
I I I M I I I I I M I I M I I I I I t I I I I I I I I I I M I I M I M I I I I I I I t 1 M I I I 
al 39 PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYXKLYGVYAEGSAX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 547>: 

gl40 . seq 

1 Atgtcggcac gCGGCAAGGG GGCAGgctat ctcAACAGTA CCGGACGACa 

51 TGTTCCCTTC CTGAGTGCCG C CAAAAT CGG GCAGGATTAT TCTTTCTTCA 

101 AAAATATCAA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCCG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

3 01 GAGCTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

3 51 GGTCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCTACGG CGCACAACTT 

4 01 TCCGCACAGC GGCAGCCGTA CAGCATGCGA ATACCGCCGA CGGCGTACGc 
451 aTCTTcaaCA GTCTCGCCGC TAccgTCTAt GccgACAGTG CCGCCGCCCA 
501 TGccgATATG CAGGGACGCC GCCTGAAAGC CGT AT CGG AC GGGTTGGACC 
551 ACAACGGTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 
601 ACGTGGGAAC AGGGCGGTGT CGAAGG CAAA ATGCGCGGCA GTACCCAAAC 
651 TATCGG CATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 
701 TGGG CATAGG ACG C AGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 
751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGTGG GCGATATCGG 
801 CT AT CT CAAA GGCCTGTTCT CctaCGGACG CTACAAAAAC AGCATCAGCC 
851 GCAGCACCGG TGCGGATGAA TATGCGGAAG GCAGCGTCAA CGGCACGCTG 
901 ATGCAGCTGG GCGCACTGGG TGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
951 AGATTTGACG GTTGAAGGCG GTCTGCGCCA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCagt GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAACTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTCTG CGACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CGCGGTAACG GGCGGCTTTA CCGGCGCGGC TGCAGCAACC 

1201 GGCAAGACGG GTGCACGCAA TATGCCGCAC ACCCGCCGGG TTGCCGGTCT 

1251 GGGGGTGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG G CACGTT AC A 

13 01 GCTACACCGG TTCCAAACAG TACGGCAACC ACAG CGG AC A AATCGGCGTA 

13 51 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 548; ORF 140.ng>: 

gl40 .pep 

1 MSARGKGAGY LNSTGRHVPF LSAAKIGQDY SFFKNIKTDG GLLASLDSVE 

51 KTAGSEGDTP SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 

101 ELDASESSAT PETVETAVAD RTDMPGIRLR RTTFRTAAAV QHANTADGVR 

151 I FNS LAATVY ADSAAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 

201 TWEQGGVEGK MRGSTQTIGI AAKTGENTTA AATLGIGRST WSENSANAKT 
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2 51 DSISLFAGIR HDVGDIGYLK GLFSYGRYKN SISRSTGADE YAEGSVNGTL 

3 01 MQLGALGGVN VPFAATGDLT VEGGLRHDLL KQDAFAEKGS ALGWSGNSLT 

3 51 EGTLVGLAGL KLSQPLSDKA VLSATAGVER DLNGRDYAVT GGFTGAAAAT 

4 01 GKTGARNMPH TRRV AG LGVD VEFGNGWNGL ARYSYTGSKQ YGNHSGQIGV 
4 51 GYRF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 549>: 

ml40 . seq 

1 ATGTCGGCAC GCGGCAAGGG GGCAGGCTAT CTCAACAGTA CCGGACGACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGCCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

3 01 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

3 51 GGCAGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

401 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 

451 ATCTTCAACA GTCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

551 ACAACGGCAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

701 TGGGCATGGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 

7 51 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 

801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 

951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACGC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

1201 GGCAAGACGG GGG C ACGCAA TATGCCGCAC ACCCGTCTGG TTGCCGGCCT 

1251 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

13 01 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 

13 51 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 550; ORF 140>: 

ml40 .pep 

1 MSARGKGAGY LNSTGRRVPF LSAAKIGQDY SFFTNIETDG GLLASLDSVE 
51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
101 ELDASESSAT PETVETAAAD RTDMPGIRPY GAT FRAAAAV QHANAADGVR 
151 IFNSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 
201 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGRST WSENSANAKT 

2 51 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 
301 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSLT 

3 51 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 

4 01 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 
451 GYRF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 140 shows 94.5% identity over a 454 aa overlap with a predicted ORF (ORF 140.ng) 
from N. gonorrhoeae: 

ml40/gl40 

10 20 30 40 50 60 

ml 4 0 . pep MS ARGKGAG YLNS TGRRVPFLSAAKIGQDYSFFTN I ETDGGLLASLDSVE KTAGSEGDTL 

I Ml MM M M M M M MM M Ml MMI I II M 1 1 1 1 1 1 1 M I M 1 1 II I 1 1 1 1 

9140 MSARGKGAGYLNSTGRHVPFLSAAKI GQDYS F FKN I KTDGGLLAS LD S VE KTAG SE GDTP 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml 4 0 . pep SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 

M I lllll M M Ml MIMM Ml M I I I I MIIIMM M MM M I II MMMM MM! 

gl4 0 S YYVRRGNAARTAS AAAHS APAGLKHAVEQGGSNLENLMVELDAS ESS ATP ETVETAVAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 0 . pep RTDMPGI RP YGATFRAAAAVQHANAADGVRI FN S LAATV Y AD S T AAHADMQGRRL KAV S D 

I M 1 1 M I = 1 1 1 : M 1 1 II II 1 1 M 1 1 M 1 1 M M I M h 1 1 It II M II M I M I 

gl40 RTDMPGI RLRRTTFRTAAAVQHANTADGVRIFNSLAATVY ADS AAAHADMQGRRLKAVSD 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml4 0 . pep GLDHNGTGLRVIAQTQODGGTWEQGGVEGK3VIRGSTQTVGIAAKTGENTTAAATLGMGRST 

I I I i 111! !1 I Ml M Mil I ! I! Ml M I il MIII^IM I I I II I I II lllh 1 I , I 

gl40 GLDHNGTGLRVI AQTQQDGGTWEQGGVEGKMRGSTQTI GI AAKTGENTTAAATLGI GRST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml40 . pep WSENSANAKTDS I SLFAGIRHDAGDIGYLKGLFSYGRYKNS I SRSTGADEHAEGSVNGTL 

1 1 !MI MM !l M IMiM I MMI 1 1 1 1 IMIMIIMI IIMM MMMIIMM I 

gl40 WSENSANAKTDSI SLFAGIRHDVGDIGYLKGLFSYGRYKNS ISRSTGADEYAEGSVNGTL 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml4 0 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGIjAGL 

I II 1 1 1 1 1 1 1 1 1 M M I II M I M 1 1 1 1 II II 1 1 1 1 1 1 1 II M II 1 1 1 1 M II M M II 

gl40 MQLGALGGVWPFAATGDLTVEGGLRHDLLKQDAFAEKGSALGWSGNSLTEGTLVGLiAGL 
310 320 330 340 350 360 

370 380 390 400 410 420 

ml4 0 . pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 

MIIIIIMIII III MM Ml I M MM I! M 'MM MM M M M Mil II 1 1 h I 

gl4 0 KL S QP LSDKAVLS ATAGVERDLNGRD YAVTGG FTGAAAATGKTGARNMPHTRRVAGLGVD 

370 380 390 400 410 420 



430 440 450 

ml40 . pep VE FGNGWNGLAR Y S Y AGS KQ YGNHS GRVGVGYRFX 

1 1 M ! M M II 1 1 1 h M 1 1 1 1 1 1 1 1 • M 1 1 1 1 1 ( 

gl4 0 VEFGNGWNGLARYSYTGSKQYGNHSGQIGVGYRFX 
430 440 450 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 55 1>: 

al40 . seq 

1 ATGTCGGCAG GCGGTAAGGG GGCAGGCTAT CTCAACCGTA CCGGACAACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCGGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GTAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGCCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

4 01 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 

4 51 ATCTTCAACA ATCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

551 ACAACGCTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

7 01 TGGGCATGGG ACACAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 

801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
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951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCATCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGCCTGG TTGCCGGTCT 

1251 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

1301 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 

1351 GGCTACCGGT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 552; OKF 140.a>: 

al40 .pep 

1 MSAGGKGAGY LNRTGQRVPF LSAAKIGRDY SFFTNIETDG GLLASLDSVE 

51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 

101 ELDASESSAT PETVETAAAD RTDMPGIRPY GATFRAAAAV QHANAADGVR 

151 IFNNLAATVY ADSTAAHADM QGRRLKAVSD GLDHNATGLR VIAQTQQDGG 

201 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGHST WSENSANAKT 

251 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 

301 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNS1T 

351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 

4 01 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 

451 GYRF* 

ml40/al40 98.2% identity in 454 aa overlap 

10 20 30 40 50 60 

ml 4 0 . pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVE KTAGSEGDTL 

III I I I II M I I I : I I I I I II ! I I I : II I I I I M I I I I I I I I I I I I I M I I I I II I I I 
al40 MSAGGKGAGYLNRTGQRVPFLSAAKIGRDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml40 . pep SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 
I I I I I I I I II I I If I I I I M I M I i I i M I I I I M I I I I I II I i II I ( I I I If II f I! I I 
al 4 0 SYYVRRGNAART AS AAAHSAPAGLKHAVEQGGSNLENLMVELDASES SAT PETVETAAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 40 . pep RTDMPGIRPYGATFRAAAAVQHANAADGVRI FNSLAATVYADSTAAHADMQGRRLKAVS D 

I I I I I I I M I I I I I I I I I M I I II I I I I I I I II : I M I I I I I M I I I M I I I II I II I II 
al40 RTDMPGIRPYGATFRAAAAVQHANAADGVRIFNNLAATVYADSTAAHADMQGRRLKAVSD 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 40 . pep GLDHNGTGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGRST 
I I I I I : I I I II I I I M i I M I t I I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I II I I I : I I 
al 4 0 GLDHNATGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGHST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml40 . pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
I II I I M I M I I I I II II I I M M I I I I I I I I 1 I I I I I M I I I I I I II I I I I I II I I I I I 
al4 0 WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 0 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 
II I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I M : I I I I I I I I I I I 
al40 MQLGALGGVN VPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNS I TEGTLVG LAG L 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 4 0 . pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
I I I I I I I I M I I I M I I I I 11 1 I M I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I It I I 
al4 0 KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
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370 380 390 400 410 420 

430 440 450 

ml 4 0 .pep VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 
II I I I I I I I I I I I I I I 11 I I I I I I I I I I I N I I I I 
al 4 0 VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 

430 440 450 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 553>: 
gl41. 



. seq 




1 


atgagcttca 


51 


GATTGGCGAA 


101 


CTTACGGTCA 


151 


CCGCAAAAAC 


201 


GGCGGGCGAA 


251 


GCCATATCGG 


301 


CCGGTGTTCG 


351 


TTTGCCGATG 


401 


TCGGTGCGGC 


451 


CAAGGTAACG 


501 


GGTCGATATG 


551 


AGCCTGTtga 


601 


TCCGAAGTGa 


651 


AGAGCGTTtt 


701 


TTTACGCCAA 


751 


GATGCGATTA 


801 


TGTACACGGC 


851 


CCGCAACCCG 


901 


GGCTTCGGCG 


951 


CCTTGCCGGT 


1001 


CCCTGAAATA 


1051 


CTCGAAGCCT 


1101 


CCTGAAAAAC 


1151 


TGTCCGACTC 


1201 


CACGGCGTTG 


1251 


CGGCGCGGAT 


1301 


ATAACTTCGG 


1351 


CGTGCGATTG 


1401 


GGAAGCGTCT 


1451 


TGCCGATCTG 


1501 


CTCTTGGGCT 


1551 


TTCCGCCGGC 


1601 


TGCCGGGCCT 


1651 


CACGG CGTGA 



GATTTTGGTT ACCGCCATCA ACCCGACTCC 



CTCGCCGCCA TGCTCGACAA CCATATCTAC 
CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 



GCACACGGCG CGATGGCGGC ATTGCTAAAA 
GGTGC AAAC C ATCGAAGGCA CTCCGGCCTT 



TGCGGAAAAA TTCTGCGACA TCAAATGCCG 
ATGCGGCAGT CGTCGTGGCG ACTGTCCGCG 
GTGGAACGCG CCAACCTTGG TGAAGAAAAC 
TTTGCCCAAC CTGT TGAAAC ACAT TTCCAA 
TGCCCGTCGT CGTTGCG CTC AACCGCTTCG 
TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 



TTTCGCCTAC GATGT CGAGT TGGGCATCAA AG ACAAAAT C 



rGGCGAAA ACC CAATATT CATTGAGCGA CAACGCCAAA 
-CCGAAGG CTTCCGCATC GCCGTACGCG GTATCACTGT 
3GGCTTCA TCGTTGCGTT GTGCGGCAAT. ATGATGAAAA 
rGAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGAA 
2ACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 554; ORF 141.ng>: 

gl41 .pep 

1 MS FKTDAETA QSSTMRPIGE I AAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTI GLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNI ID GMGKPVDGVM RPD GFDITVA 

2 01 SEVMAVFCIA KDISDLKERF GNILVAYAKD GSPVYAKDLK AHGAMAALLK 
251 DAI KPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 
301 GFGADLGAEK FCDIKCRLAG LKPDAAWVA TVRALKYNGG VERANLGEEN 

3 51 LEALAKGLPN LLKHISNLKN VFGLPWVAL NRFVSDSDAE LAMIEKACAE 
401 HGVEVSLTEV WGKGGAGGAD IiARKWNAID NQPNNFGFAY DVELGIKDKI 
451 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LD KMP I CMAK TQYSLSDNAK 
501 LLGCPEGF RI AVRG I TVS AG AGFIVAL CGN MMKMPGLPKV PAAEKIDVDE 
551 HGVIHGLF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 555>: 
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ml41 . seq 

1 ATGAGCTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTTGGTCT GAATGCCGAC AACATTGAGC 

101 CTTACGGTCA TTACAAGGCG AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCGTTGC 

251 GCCACATCGG CAAAGATGCC GTGATTGCCC TGCGCGAACC TTCTCTGGGG 

3 01 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 
351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

4 01 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 
451 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 
501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGTA 
551 AACCCGTTGA CGGCGTGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 
601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 
651 AGAGC GTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 
701 TTTACGCCAA AGATTTGAAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 
751 GATGCGATTA AG C CCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 
801 CGTAGACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 
851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 
901 GGCTT CGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGACA TCAAATGCCG 
951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTC CAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACGC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGC CATTGAA AGTCAAACCA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGG CAT CAA AGACAAAATC 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

1401 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TT GGACAAAA 

1451 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAG CGA CAACGCCAAA 

1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 

1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 

1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 

1651 GAAGG CGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 556; ORF 141>: 

ml41 . pep 

1 MSFKTDAEIA QSSTMRPIGE IAAKLGLNAD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLIIiV TAINPTPAGE GKTTVTIGLA DALRH I GKD A VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDI NLHFTGD FHAI GAANNLi LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPD GFDITVA 

201 SEVMAVFCLA KDISDLKERL GNTLVAYAKD GSPVYAKDLK ANGAMAALLK 

251 DAI KPNLVQT IEGTPAFVHG G P FAN I AHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK F CD I KCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

351 LDALE KGLPN LLKHISNLKN VFGLPWVAL NRFVSDADAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 

451 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEDFR I AVRG I TVS AG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 

551 EGVIHGLF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 141 shows 97.5% identity over a 558 aa overlap with a predicted ORF (ORF 141.ng) 
from N. gonorrhoeae: 

ml41/gl41 

10 20 30 40 50 60 

ml41 .pep MSFKTDAEIAQSS™RPIGEIAAKLGIiNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

MINIM I 1 1 1 1 1 1 1 1 1 1 1 1 1 : M 1 1 1 1 1 1 1 [ 1 1 1 1 ! 1 1 1 : 1 1 M ; 1 1 ! | M 

gl41 MSFKTDAETAQSSTMRPIGEIAAKLGLNVDNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

10 20 30 40 50 60 
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70 80 90 100 110 120 

ml41 . pep TAINPTPAGEGKTTVTIGLADALRHIGKDAVIALREPSLGPVFGVKGGAAGGGYAQVLPM 

I M I I II Ml II I! I M M Ml I I I II I I: ! 1 1 I I M I M I I Ml II II !! II I I M M 

9141 TAI NPTPAGEGKTTVT I GLADALRH I GKDS V I ALRE PS LG P VFGVKGGAAGGG YAQVLPM 



70 



80 



90 



100 



110 



120 



130 140 150 160 170 180 

ml4 1 . pep EDI NLHFTGDFHAI GAANNLLAAMLDNH I YQGNELNI DPKRVLWRRVVDMNDRQLRNI ID 

M MMMIMI MM I MM II IMMMMMIMM IMM MIMIIM I II I IM 

914 1 ED INLHFTGDFHAI GAANNLLAAMLDNH I YQGNELNI DPKRVLWRRVVDMNDRQLRNI ID 

130 140 150 160 170 180 

190 200 210 220 230 240 

m!41 .pep GMGKPVDGVMRPDGFDITVASEVmVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 

M Ml 1 1 Mill M I I M I I I II 1 1 M II I M M IM I Ml M I I I M II I M I M I M 

9l 41 GMGKPVDGVMRPDGFDI TVASEVMAVFCLAKDI SDLKERFGNI LVAYAKDGSPVYAKDLK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml41 . pep ANGAMAALLKDAI KPNLVQTIEGTPAFVHGGPFTVNIAHGCNSVTATRLAKHLiADYAVTEA 

M MM III I M II M I I M I II M MM I M M M I I I M M I M M M M II M M 

gl4 1 AHGAMAALLKDAI KPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 1 . pep GFGADLGAE KFCD I KCRLAGLKPDAAVVVATVRALKYNGGVERANLGE ENLDALEKGLPN 

MMMIMIIMM II IMIIIMIMMII IMIMI II M MMIIMI Mill 

914 1 GFGADLGAEKFCDIKCRLAGLKPDAAVVVATVRALKYNGGVERANLGEENLEAIiAKGIjPN 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml4 1 . pep LLKHI SNLKNVFGLPVWALNRFVSDADAE LAM I EKACAEHGVEVS LTEVWGKGGAGGAD 

MMIIIMIMMMI M IM MM IM M IIIMI : M Ml II Ml IMM I MM III 

9141 LLKH I SNLKNVFGLPVVVALNRFVSDSDAEIjAM I EKACAEHGVETVS LTEVWGKGGAGGAD 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml4 1 . pep LARPOAmAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

MIMMIMM Mill IMMMMIMM IMMII Mill I lllllll II MM M 

g!41 LARKVVNAIDNQPNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml4 1 . pep LDKMPIO^KTQYSLSDNAKLLGCPEDFRIAVRGIWSAGAGFIVALCGNMMKMPGLPK^ 

IM IIIIMIIIIMM II IM MIM MM I II I M I M II I II II 1 1 II II II M II M 

gl41 LDKMPICMAKTQYSLSDNAKLI^CPEGFRIAVRGIWSAGAGFIVALCGNMMKMPGLPKV 

490 500 510 520 530 540 

550 559 
ml 4 1 . pep PAAEKI DVDAEGVI HGLFX 

MMIIMI MMIIMI 

gl4 1 PAAEKIDVDEHGVIHGLFX 

550 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 557>: 

a!41 . seq 

1 ATGAGTTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 
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201 GGCGGGCGAA GGTAAAACCA CCGTAACCAT CGGTTTGGCG GACGCATTGC 

251 GCCATATCGG CAAAGACTCT GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 

301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

4 01 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 

4 51 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGCA 

551 AGCCTGTTGA CGGCGTGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 

601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAG AC AT C A GCGATTTGAA 

651 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

7 01 TTTACGCCAA AG ATT TG AAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 

751 GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 

801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTTCGGCG CGGACTTGGG CGCGGAAAAA • TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACTC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 

12 51 CGGCG CGG AT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AG AC AAAAT C 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

14 01 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 

14 51 TGCCGATCTG CAT G G CG AAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 

1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 

1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 

1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AG AAAAT CG A TGTGGACGCA 

1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 



This corresponds to the amino acid sequence <SEQ ED 558; ORF 141. a>: 

al41 . pep 

1 MSFKTDAEIA QSSTMRPIGE IAAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAIN PT PAGE GKTTVTIGLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPDGFDITVA 

2 01 SEVMAVFCLA KDISDLKERL GNILVAYAKD GSPVYAKDLK AN G AMAALLK 

251 DAIKPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

351 LDALEKGLPN LLKHISNLKN VFGLPVWAL NRFVSDSDAE LAMIEKACAE 

4 01 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 

4 51 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEDFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEK I DVDA 

551 EGVIHGLF* 

ml41/al41 99.5% identity in 558 aa overlap 

10 20 30 40 50 60 

ml 41 .pep MSFKT DAEIAQSSTMRPIGEIAAKLGLNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

M I I f 1 I I I I I I I I I I I I I II I 11 I I I I : I I I I I It I I I I I I I t I | I I | | I M I I M t I I 
al41 MSFKT DAE I AQSSTMRPIGE I AAKLGLNVDN I EPYGHYKAKINPAEAFKL PQKQGRLILV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml4 1 . pep TAINPTPAGEGKTTVTIGLADALRHIGKDAVIALREPSLGPVFGVKGGAAGGGYAQVLPM 
I I ! M M I II I I I I I ! | | | M | | | | | | M : I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
al41 TAINPTPAGEGKTTVTIGLADALRHIGKDSVIALREPSLGPVFGVKGGAAGGGYAQVLPM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml41 .pep EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRVVDMNDRQLRNIID 
N I I I I II I I I I II I I I I I I I I I I M I I I I I I I II I II I I I I M I I I I I I I I I I I I II I I 
al41 EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRWDMNDRQLRNIID 

130 140 150 160 170 180 
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190 200 210 220 230 240 

ml 41 .pep GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 
I I I I ! I I I I I [ I I I I I I I M I II I I I 1 I I I I I i I I I I I I If I I t I I f I I I I I I I I I I I I I 
al 4 1 GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 4 1 . pep ANGAMAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 
I I I I It I I I I I I I I I I I I M I I I M M I I I I I I I I I I I I I I I I I I I I I I I M I i I I I I I I 
al41 ANGAMAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 1 . pep GFGADLGAEKFCDIKCRIiAGLKPDAAVWATVRALKYNGGVERANLGEENLDALEKGLPN 
I I I I I I II I I I I I I M I I I I I I I I I i I I I I I I I I i I I I I M I M M I I M II I I I M I I I 
al 4 1 GFGADLGAEKFCDIKCRLAGLKPDAAVVVATVRALKYNGGVERANLGEENLDALEKGLPN 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 4 1 . pep LLKHISNLKNVFGLPVWALNRFVSDADAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 
M I I I I I I I M M I II I I I M I I I I I : I II I I I I I I I I I I I I I I I I I II I II II I II I II 
al41 LLKHISNLKNVFGLPVWALNRFVSDSDAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 

370 380 390 400 410 420 



430 440 450 460 470 480 

ml 41 .pep LARKWNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 
II I I I M I I I I M II I I I I I II I I f I I I I I I I I I I I I I M I I j I I I I I I I I I I II I IE I I 
al 4 1 LARKWNAIESQTNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 41 . pep LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 
I M I I M M I I I I I I! I If I I I I I I I I I I I M I I II I 1 I I I M I i I I I I I I I I M I M M 
al 4 1 LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPICV 

490 500 510 520 530 540 



550 559 
ml 41 .pep PAAEKIDVDAEGVIHGLFX 
M I I I I I I I I I I I I I I I I I 
al 4 1 PAAEKIDVDAEGVIHGLFX 

550 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 559>: 

gl42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCTTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAAATATGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCAT ATTGATGCAG AGg CTGCCGT 

201 TTTCCGACAG GATcggaATG AttcgCGCAC TCCGGTTTAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

4 01 GCCATTTTTC CCCTTTAAAC CGTCCCCTAT ATAAGAATGC TGCACACAAG 

451 GCATCCCCCC ATGTGCAGCA GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 560; ORF 142.ng>: 

g!42 .pep 

1 MRADFMFADN MPVQVRQRAF YFKLSRFAAM PNMVGKPLFG RQAGQPGKMF 

51 GNILMFVRQH IDAEAAVFRQ DRNDSRTPVY AQHHGRRLVG NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN RPLYKNAAHK 

151 ASPHVQQF* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 561 



>: 



ml42 . seq 








1 


ATGCGTGCCG 


ATTTCATGTT 


TGCCGACAAT 


51 


ACGCGCCCTC 


TATTT CAAGT 


TGTCCCGTTT 


101 


TAGGCAAACC 


GCTCTTCGGG 


CGACAGGCCG 


151 


GGCAACATCC 


TGATGTTCGT 


CCGCCAGCGT 


201 


TTT CCGACAG 


GAT CGGAATG 


ATTCGCGCAC 


251 


ACGGTCGGCG 


GCTCGTCGGT 


AACCGGCGCG 


301 


GTAACGCCCT 


GCCGCACCGT 


CTGTCGTGAT 


351 


AAGATGCCAT 


CGCATCACGG 


AACGAAGTTT 


401 


GCCATTTTTC 


C CCTTTAAAC 


TGTCCCCTAT 


451 


GCATCCCCcC 


ATGTGCAGCA 


GTTTTGA 



ATGCCCGTGC AGGTGCGCCA 
TGCCGCGATG CCAGATGTGG 
GTCAGCCCGG CAAAATGTTC 
ATTGATGCAG AGGCTGCCGT 
TCCGGTTGAT GCACAGCATC 
ACCGCCGTCA TTGTAATGCC 
GACATGAACG CCTGCCGCGC 
GAAAATTTTT CTGCAAATCC 
ATAAGAATGC TGCACACAAG 

This corresponds to the amino acid sequence <SEQ ID 562; ORF 142>: 

ml42.pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 
51 GNILMFVRQR IDAEAAVFRQ DRNDSRTPVD AQHHGRRLVG NRRDRRHCNA 
101 VTPCRTVCRD DMNACRARCH RITERSLKIF LQIRHFSPLN CPLYKNAAHK 
151 ASPHVQQF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 142 shows 93.7% identity over a 158 aa overlap with a predicted ORF (ORF 142.ng) 
from N. gonorrhoeae: 

ml42/gl42 



10 20 30 40 50 60 

ml42 .pep MRADFMFADNMPVQVRQRALYFKLSRFAAMPDVVGKPLFGRQAGQPGKMFGNILMFVRQR 

MM I IMIMI I I II M Mill IMMM MM II II IMM! I I M MMM MUM 

gl42 MRADFMFADNMPVQVRQRAFYFKLSRFAAMPNMVGKPLFGRQAGQPGKMFGNILMFVRQH 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 4 2 . pep I DAEAAVFRQDRNDSRTPVDAQHHGRRLVGNRRDRRHCNAVT PCRTVCRDDMNACRARCH 

Mill MM II II IIMM: III II MMII MMI IMIM II I I M Ml MMM 1 1 

gl42 I DAE AAVFRQDRNDS RTPVYAQHHGRRLVGNRRNRRHCNAVTP CRTVCRDDMNACRTGCH 

70 80 90 100 110 120 



130 140 150 159 

ml42 . pep RITERSLKI FLQIRHFSPLNCPLYKNAAHKASPHVQQFX 

Illlllll 1 I I I I t I t I i 1 Ml I I I II I MM I MM 
g 1 4 2 RI TERSLKS FLQ I RHFS PLNRPLYKNAAHKAS PHVQQFX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 563>: 

al42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTT CAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTT CCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCCGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

401 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

4 51 GCACCCCCCA TGTGCAGCAG TTCTGATTCA AAAAGCCGTC GGTCGGACAT 

501 TTCCGCGCGT TACGGCGTAT TACGAGTTCA ACGCATCCTC GATTTTGGCA 

551 AGTTCTGCCA ACAGGTCTTT AAGCAGCAGC ATTTTCTCGC GGCCCAGCAC 

601 TTCCTCGATA GCGTCGTAAC GCTCGTCCAC TTCTTCGCCG ATTTCCTCAT 

651 ACAGCTTCTC GCCCTCGGCA GTCAGCTTCA GAAAAACACG TCGTTGGTCG 

701 TTGGAAGGTT TCAGGCGGAC AACCAAACCC GCTTTTTCAA GGCGGGTCAG 

751 GATACCGGTC AGGCTGGGGC GCAAAATGCA CGCCTGATTC GCCAAATCTT 
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801 GAAAGTCCAG CGTGCCGTTT TCCGCCAAAA GACGGATAAT CCGCCATTGC 
851 TGATCGGTAA TATTCGCCTG ATTCAGAATA GGCCTGAATT GGGTCATCAG 
901 GGCTTCCCTT GCCTGTATCA GACCGATATT GATAGACGCA TGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 564; ORF 142.a>: 

al42 .pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDVVGKPLFG RQAGQPGKMF 

51 GNILMFVRQR I DAEAAVFRQ DRNDSRTPVD AQHHGRRLVR NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN CPLYKNAAHK 

151 APPMCSSSDS KSRRSDISAR YGVLRVQRIL DFGKFCQQVF KQQHFLAAQH 

201 FLDS WTLVH FFADFLIQLL A LGSQLQKNT SLWGRFQAD NQTRFFKAGQ 

251 DTGQAGAQNA RLIRQILKVQ RAVFRQKTDN PPLLIGNIRL IQNRPELGHQ 

301 GFPCLYQTDI DRRMF* 

m!42/al42 96. 1% identity in 153 aa overlap 

10 20 30 40 50 60 

ml42 . pep MRADFMFADNMPVQVRQRALYFKLSRFAAMPDVVGKPLFGRQAGQPGKMFGNILMFVRQR 

! I I I I I I I I I M I I 1 I I I i I I I I I I I II I I I I I I I I I I 1 I I I I I I I I I I I I I I i I I I I I i 

al42 MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPGKMFGNILMFVRQR 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml42 . pep IDAEAAVFRQDRNDSRTPVDAQHHGRRLVGNRRDRRHCNAVTPCRTVCRDDMNACRARCH 
I i I I II I I I I I I I M I I M I I I I I I I I I I I I I : I II I I I I 11 I I I I I I M I I I I I : II 
al42 IDAEAAVFRQDRNDSRTPVDAQHHGRRLVRNRRNRRHCNAVTPCRTVCRDDMNACRTGCH 

70 80 90 100 110 120 

130 140 150 159 

ml42 .pep RITERSLKIFLQIRHFSPLNCPLYKNAAHKASPHVQQFX 

I II I I I I I I I I M II t I I I I I I II I I I I I! I 
al42 RITERSLKSFLQIRHFSPLNCPLYKNAAHKAPPMCSSSDSKSRRSDISAR YGVLRVQRIL 

130 140 150 160 170 180 

al42 DFGKFCQQVFKQQHFLAAQHFLDSWTLVHFFADFLIQLLALGSQLQKNT SLWGRFQAD 

190 200 210 220 230 240 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 565>: 

gl43 . seq 

1 ATGTTGAGCT TCGGCTATCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAGATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTTCAGCCG 

151 ATAGTGgGCT ACTACTCAGA CCGCACTTGG AAGCCGCGCT TGGGCGGCCG 

201 CCGCCTGCCG TATCTGCTTT ACGGCACGCT GATTGCGGTC ATCGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCCTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTGGACG TGTCGTCGAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGATATG GTCAACGAGG 

4 01 AGCAGAAAAG CTACGCCTAC GGGATTCAAA GTTTCTTAGC GAATACGGAC 

451 GCGGTTGTGG CAGCGATTCT GCCGTTTGTG TTcgcgtata TCGGTTTGGC 

501 GAACACTGCC GAGAAAGGCG TTGTGCCACA AACCGTGGTC GTAGCATTCT 

551 ATGTGGGTGC GGCGTTACTG ATTATTACCA GTGCGTTCAC AATCTCCAAA 

601 GTCAAAGAAT ACGACCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGTT CGAACTCTTA AAAACCGCGC 

7 01 CTAAAGTGTT TTGGACGGTT ACTCCGGTAC AGTTTTTCTG CTGGTTCGCC 

751 TTCCGGTATA TGTGGACTTA CTCGGCAGGC GCGATTGCAG AAAACGTCTG 

801 GCACACTACC GATGCGTCTT CCGTAGGCCA TCAGGAGGCG GGCAACCGGT 

851 ACGGCGTTTT GGCGGCGGTG TAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 ATTCTGGCAA AAGTACCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GTTTCTTCTC TATCTTCTTC ATCTACAATC 

1001 AATACGCACT CATCCTGTCT TATATCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGGCC AACGCTTTGT CGGGCAAACA 

1101 CAT GG AT ACT TATTTGGGCC TGTttaacgg ctctgtCTGT ATGCcgcaaa 

1151 tcgTcgctTC GctgttgAGT TTCGTGCTTT TCCCGATGCT GGGCGGCCAT 
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1201 CAGGCAACCA TGTTCTTGGT TGCAGGCGCA GTCTTGCTGC TGGGAGCCTT 

1251 CTCAGTCTGT CTGAT TAAAG AGATCCACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 566; ORF 143.ng>: 

gl43 . pep 

1 MLSFGYLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVG YYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKSYAY GIQSFLANTD 

151 AWAAILPFV FAYIGLA NTA EKGVVPQT VV VAFYVGAALL IITSA FTISK 

201 VKEYDPETYA RYHGIDVAAN QEKANWFELL KTAPKVFWTV TPVQFFCWFA 

2 51 FRYMWTYSAG AIAENVWHTT DASSVGHQEA GNRYG VLAAV *SVAAVICSF 

301 ILAKVPNKYH KAG YFGCLAL GALGFFSIFF IYNQ YALILS YILIGIAWAG 

351 IITYPLTIVA NALSGKHMDT YLGLFN GSVC MPQIVASLLS FVL FPMLGGH 

4 01 QATM FLVAGA VLLLGAFSVC L IKEIHGGV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 567>: 

ml43 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAAATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

2 51 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC AAATACGGGC 

4 51 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACGATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

7 01 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 

7 51 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCT TGTTTAACGG CTCTATCTGT ATGCCTCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 568; ORF 143>: 

ml 4 3 . pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGVVPQT VV VAFYVGAALL VITSA FTIFK 

201 VKEYDPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAG YFGCLAL GALGFFSVFF IGNQ YALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFN GSIC MPQIVASLLS FVL FPMLGGL 

401 QATM FLVGGV VLLLGAFSVF LIKETHGGV* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from TV. gonorrhoeae 

ml43 / gl43 93.9% identity in 42 9 aa overlap 

10 20 30 40 50 60 
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ml 4 3 .pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 
I I I I I : M I I M M M I I I I ! I I I I I I I M I M 1 M I 1 I I I I M i I 1 I f I I M : I I I I II 
g 14 3 MLSFGYLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGYYSDRTW 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 3 .pep KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 

I I I I II I I I I I I I I M I I I I I I I I I I I I I II 1 I I I I I I M I I I I I I I I I I I I I I I M I I I 
g 14 3 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 3 pep QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAWAAILPFVFAYIGLANTAEKGWPQTVV 

I I I I I I It I M I M I I : I Mill I I I I I I I I I I I I I t I I I I I I I I I I I I I I M 

g 14 3 QPFKMMVGDMVNEEQKSYAYGIQSFLANTDAWAAILPFVFAYIGLANTAEKGWPQTW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml43 .pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
E I I I I I I I I I : I I II I I I I I I I M I I I I I II I I I I I IE I I I I I I I : I I I I I I I I : I I I I 
g l 4 3 VAFYVGAALLIITSAFTISKVKEYDPETYARYHGIDVAANQEKANWFELLKTAPKVFWTV 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 4 3 . pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
I I I I I I I I I I : M I I I I I I I I t I I It I I I I I I I I I : I M I I I I I I I I I I I I I I I I I I 
g 143 TPVQFFCWFAFRYMWTYSAGAIAENVWHTTDASSVGHQEAGNRYGVLAAVXSVAAVICSF 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 3. pep V L AKV PN K Y HKAG Y FG C L ALG ALG FF S V FF I GN Q Y AL VL S YT L I G I AW AG IITYPLTIVT 

: I I I I I I I I I II I I I I I I I I I I I I I M : I I I 11111:111 I I I I II I I M I I I M M : 
gl43 ILAKVPNKYHKAGYFGCLALGALGFFSIFFIYNQYALILSYILIGIAWAGIITYPLTIVA 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 3 . pep NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGWLLLGAFSVF 

MINIM I II I I II I I : I I I I II I I I I I II II II I I I I I I I I M : I : II I I I I I M 
g l 4 3 NALSGKHMDTYLGLFNGSVCMPQIVASLLSFVLFPMLGGHQATMFLVAGAVLLLGAFSVC 

370 380 390 400 410 420 

430 

ml 4 3 . pep LIKETHGGVX 
MM Mill 
gl4 3 LIKEIHGGVX 

430 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 569>: 

al43 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAGATG AGCCGCATCT TCCAGACGCT CGGTGCCGAT CCGCACAGCC 

101 TCGGCTGGTT CTTTATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC GAATACGGGC 

451 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACAATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

701 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 
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7 51 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCC TGTTTAACGG CTCTATCTGT ATGCCGCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 



This corresponds to the amino acid sequence <SEQ ID 570; ORF 143.a>: 

al43 .pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHSLGW FFIL PPLAGMLVQP 

51 IVG HYSDRTW KPRLGGR RLP YLLYGTLIAV IVMIL MPNSG SFGFGY ASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLAN TA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYNPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VIiAAV QSVAAVICSF 

301 VLAKV PNKYH KAGY FGCLAL GALGFFSVFF I GNQY ALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFNGSIC MPQ IVASLLS FVLFPMLGGL 

401 OATM FLVGGV VLLLGAJSVF LI KETHGGV* 

ml43/al43 99.5% identity in 429 aa overlap 

10 20 30 40 50 60 

MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 

M I I I I I I | | | | I I 1 I I I It II M : I I M 11 II I It I M I I I I i I It I I I II 

MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHSLGWFFILPPLAGMLVQPIVGHYSDRTW 
10 20 30 40 50 60 



ml 4 3 .pep 
al43 



70 80 90 100 110 120 

ml43 pep KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASI^ALSFGALMIALLDVSSNMAM 
M | I I 1 i I M M I I I I I I M I I I I I I I M I I I E I 1 I M I I I I I I I I I II I! I M I I I I I t 
al43 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 3 pep QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAVVAAILPFVFAYIGLANTAEKGVVPQTVV 

M I I I ! I I I I I I I I I I I I I I I I M I I II I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I 
a 143 QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAWAAILPFVFAYIGLANTAEKGWPQTW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 3 pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
| M M I II I I 1 I I I M I I I I I I I I : M I I I I I I I I I I I I I I I II I I I II I I! II I I I I I I 
a 1 4 3 VAFYVG AALLV I T S AFT I FKVKE YN PET Y ARYHG I DVAANQEKANW I ELLKTAPKAFWT V 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml43 pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
I I I I I I I I I I I I I I I I I M I ! I I I I M I I I It M I I I I II I I I I I I I I I M I I I M I I I I 
al 4 3 TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 3. pep VLAICVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 
| I M I I I I I I I I 1 M I I I I I 1 I I M II I 1 I I I I 1 I I I M I I I II I I I I I I I I I I I I I M I 
a 143 VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 

310 320 330 340 350 360 



ml4 3 .pep 



370 380 390 400 410 420 

NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGWLLLGAFSVF 
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I I I I I I II I I I II I I I M I I I 1 I I t I I I I I I I I it 1 I I I l I I I M i I 1! I I II I I II I 1 I 
al4 3 NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGVVLLLGAFSVF 

370 380 390 400 410 420 

430 

ml 4 3 . pep LIKETHGGVX 
I I I I I I I I I I 
al4 3 LIKETHGGVX 

430 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 571>: 

gl44 . seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGGGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGTGC GTCTTCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC CCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGTAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCcgtTT 

351 CAACGCGGTG GCGGCAGACG GccgacggTt atCCCAACGA TTTGGatatT 

4 01 TCctaccgCT TGGACGAGGA CGGCCGGCTT ACCGT taccT ATCGCGCCAC 

4 51 CGCgctCGGC GACACGGTGT TCGACCCGAC GCTGCACATT TACTGGCGGC 

501 TGGACGCGGG CCTGCACGAT GCGGTTCTGC ATATTCCGCA GGGCGGACAT 

551 ATTCCGGCCG ATGCCGAAAA ACTGCCCGTC TTAACGGTTT CAGACGGCCT 

601 CGAAGTATTT GA 

This corresponds to the amino acid sequence <SEQ ID 572; ORF 144.ng>: 

gl4 4 , pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT C VFVLDLGGIV QEFSVL ADGV 

51 RENPWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRRLSQR FGYFLPLGRG RPAYRYLSRH 

151 RARRHGVRPD AAHLLAAGRG PARCGSAYSA GRTYSGRCRK TARLNGFRRP 

201 RSI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 573>: 

ml4 4 . seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGTCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGATGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAAC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTGg 

4 01 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TAcCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATGCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 

This corresponds to the amino acid sequence <SEQ ID 574; ORF 144>: 

ml 4 4 . pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT R VCVLDLGGIV QEFSVL ADGV 

51 RENLWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLATVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYA 

2 01 GRCRKTARLN GFRRPRSI* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from TV. gonorrhoeae 

m!44 / gl44 91.3% identity in 218 aa overlap 

10 20 30 40 50 60 

ml 4 4 . pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 
I M I I I 1 I 1 I I 1 I I I II I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I 1 I I I 
gl4 4 MSDTPATRDFGLIDGRAVTGYVLSNRRGTCVFVLDLGGIVQEFSVLADGVRENPWSFDD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 4 . pep AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
I I I I E I M I I M I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 11 I I ! I I 
gl4 4 AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 4 .pep AADGRSWLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 
Ml I I I I I I I U I M I I I I I I II I I I I M I I M I I I I M I I I I M 

gl4 4 AAD GRRLSQRFG — YFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 

190 200 210 219 

ml 4 4 . pep AAGRGPARCGS AYSAGRTYAGRCRKTARLNGFRRPRS IX 

I I 11 I I I I I I I I I I I ! I I J : I I I I M M 1 I I I I II I I I I 
gl 4 4 AAGRGPARCGS AY SAGRTYSGRCRKTARLNGFRRPRS IX 

170 180 190 200 



The following partial DNA sequence was 

al44 . seq 

1 ATGAGCGATA CCCCCGCTAC 

51 CGTAACCGGC TATGTGCTGT 

101 TGGACTTGGG CGGGATTGTG 

151 CGCGAAAACC TCGTGGTGTC 

201 TCCGTTTCAG ATTAACAAGC 

251 GTGCGGCGTT C G AC AT C AAC 

301 GGCAGGAACG CGCTGCACGG 

351 CAACGCGGTG GCGGCAGACG 

4 01 CAACAGTCGG CCGACGGTTA 

451 CCGCTTGGAC GAGGACGACC 

501 TCGGCGACAC GGTGTTCGAC 

551 GCGGGCCTGC ACGATGCGGT 

601 GGCCGATGCC GAAAAACTGC 

651 TATTTGA 



identified in N. meningitidis <SEQ ID 575^ 



CCGCGATTTC 
CCAACCGGCG 
CAGGAATTTT 
G T T CG AC G AT 
AGATAGGGCG 
GGCAGGACTT 
CGGTTCGCAC 
GCCGTTCGGT 
TCCCAACGAT 
GGCTTACCGT 
CCGACGCTGC 
TCTGCATATT 
CCGTCTCAAC 



GGCCTGATCG 
TGGTACGCGT 
CCGTTTTGGC 
GCGGCTTCCT 
CGTGGCCGGA 
ACCGCGTGGA 
GGGCTGGCCG 
GGTGCTGCGC 
TTGGATTTGG 
TACCTATCGC 
ACATTTACTG 
CCGCAGGGCG 
GGTTTCAGAC 



ACGGGCGTGC 
GTCTGCGTGC 
AGACGGCGTG 
ATGCGGACAA 
CGCATCCGCG 
GGCCAACGAA 
TTACCCGTTT 
AGCCGCCTG . 
ATATTTCCTA 
GCCACCGCGC 
GCGGCTGGAC 
GACATATTCC 
GACCTCGAAG 



This corresponds to the amino acid sequence <SEQ ID 576; ORF 144.a>: 



al44 . 



pep 
1 
51 
101 
151 
201 



MSDTPATRDF GLIDGRAVTG YVLSNRRGTR VCVLDLGGIV QEFSVLA DGV 

RENLVVSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLXTVGRRL SQRFGFGYFL 

PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYS 
GRCRKTARLN GFRRPRSI* 



m!44/al44 99.1% identity in 218 aa overlap 

10 20 30 40 50 60 

ml44 . pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 
I I I I I I M I I I I I I I M I I I I I 1 I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I 
a 1 4 4 MS DTPATRDFGLI DGRAVTGYVLSNRRGTRVCVLDLGG I VQE FS VLADGVRENLWS FDD 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 4 4 .pep AASYADNPFQ I NKQIGRVAGR I RGAAFD I NGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
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I l | | I | l | | l I I I II I I I M I I I I I ! M I I I I I I I I M I I I I I I I t I ! M M M I I I I 1 I 
AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 

70 80 90 100 110 120 

130 140 150 160 170 180 

AADGRSWLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

| | ) I I I I I I I I I 1 I I I I II M I I I I I 11 I I I I I I I I I 1 I I I I 1 I I I I I I I I I I I M I I I 
AADGRSWLRSRLXTVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 
130 140 150 160 170 180 

190 200 210 219 

AAGRGPARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 
I I I I II I I I I I I II I i I M : I I I I I M I I I I I I I I M I I 
AAGRGPARCGSAYSAGRTYSGRCRKTARLNGFRRPRSIX 
190 200 210 

The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 577>: 

gl46 . seq 

1 ATGAAGCAAA TCCCCCTCCG CCTTCTCCAG GTCGTCATTG ACCACGACAA 

51 AGTCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAa ctTCCCGACT GTCCGTCCCG CGCcctTTGA GGCGCGCGGC 

151 AAGCACGTCG AAAGAAGGCG GCAGGATAAA GATACCGACA GCTTCCGGCA 

201 GCGCGTTGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACG CCCTCCGTGC TTGTGCCGTA 

301 ATAGTTGCCG AATACGTCTG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGATT CAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

4 01 TCGCCTTCAC GCGGCGGGCG CGTCGTATGC GACACGGAAA CGCGCAAACC 

4 51 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAG CGGCCG A AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAA CCGATTGTAT CACAACGGAC ACCCTATTTC 

601 ATATTTGCCG AT G C C CAT AT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 578; ORF 146.ng>: 

gl4 6 . pep 

1 MKQIPLRLLQ WIDHDKVEQ YGLFDFMPCL RQPPLDNFPT VRPAPFEARG 

51 KHVERRRQDK DTDSFRQRVA NLRRALNVDF QNHVIACRRQ R IHALRACAV 

101 IVAEYVCV FQ KSLLRDKRFK LFFGNKVIMY AVCFAFTRRA RRMRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPYF 

201 IFADAHILPL LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 579>: 

ml46 . seq 

1 ATGGCGCAAA TCCTCCTCCG CTCGCGCCAA GTCGTCATTG ACCACGACAA 

51 AGTCAAACAA TACGGACTGC TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GGCGCGCGGC 

151 AAGTACGTCG AAAGAAGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCGTCGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC CTGTGCCGTA 

301 ATAGTTGCCA AATACGTCGG CGTATTCCAA AAAAGCTTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

4 01 TCGCCTTCAC GCGGCGGGCG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAATGGAC ACCCAGTTTC 

601 CTATTTGCCG ATG C C CAT AT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 580; ORF 146>: 

ml 4 6 . pep 

1 MAQILLRSRQ WIDHDKVKQ YGLLDFMPCL RQPPLDNFPT VRPASVEARG 

51 KYVERRRQDK DADGFGQRVA NLRRALNVDF QNHVIACRRQ R IHTLRACAV 

101 I VAKYVGV FQ KSFLRDKRLK LFFGNKVIMY AVCFAFTRRA RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQWTPSF 

201 LFADAHILPL LF* 



ml 4 4 .pep 
al44 

■» 

ml 4 4 . pep 
al44 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml46 / gl46 90.1% identity in 212 aa overlap 

10 20 30 40 50 60 

ml 4 6 . pep MAQILLRSRQVVIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
I 1 I II I I I I i II 1 I : 1 I I I : I I I I I I I I II II I II I I 11 I I I I I 1 : I M I M 1 I 
gl4 6 MKQIPLRLLQWIDHDKVEQYGLFDFMPCLRQPPLDNFPTVRPAPFEARGKHVERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 6 . pep DADGFGQRVANLRRALNVDFQNHVI ACRRQRIHTLRACAVI VAKYVGVFQKSFLRDKRLK 

t : I : I I 1 I I I I I I I I II I II I I I I 11 I I I II I : I I I II I I M : I t II I I I : I I I I I : I 
gl 4 6 DTDSFRQRVANLRRALNVDFQNHVI ACRRQRIHALRACAVI VAEYVCVFQKSLLRDKRFK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 6 . pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 
I I I I 1 M M I I M I I I M I I I I : I I I II I I I I I I I I I I I I 1 I I I M I I II I I I I I I I I II 
gl4 6 LFFGNKVIMYAVCFAFTRRARRMRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

130 140 150 160 170 180 

190 200 210 

ml4 6 . pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 
I I I I I I I I I I I I I I I II I : I I 1 II I I I I I I I 
gl4 6 GHIFYLYIFQPIVSQRTPYFIFADAHILPLLFX 

190 200 210 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 58 1>: 

al46. seq 

1 ATGGCGCAAA TCCTCCTCCG CCCGCGCCAA GTCATCATTG ACCACGACAA 

51 AATCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GACGCGCAGC 

151 AAGCACATCG AAAGACGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCATCTCG AACCTGAGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAACCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC TTGTGCCGTA 

301 ATAGTTGCCG AACACGTCCG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGACG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

451 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAACGGAC ACCCGGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 582; ORF 146.a>: 

al4 6. pep 

1 MAQ1LLRPRQ VIIDHDKIEQ YGLFDFMPCL RQPPLDNFPT VRPASVETRS 

51 KHIERRRQDK DADGFGQRIS NLSRALNVDF QNHVITCRRQ RIHTLRACAV 

101 IVAEHVRVFQ KSLLRDKRLK LFFGNKVIMY AVCFAFTRRT RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPGF 

201 LFADAHILPL LF* 

ml46/al46 90.6% identity in 212 aa overlap 

10 20 30 40 50 60 

ml4 6 .pep MAQILLRSRQVVIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
I I I I I I I I I I : I II I I :: I I I I : I I I I M I I I I I I I I I I I I I II I I : I : I :: I II II I I 
al4 6 MAQILLRPRQVIIDHDKIEQYGLFDFMPCLRQPPLDNFPTVRPASVETRSKHIERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml 4 6 . pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 
I II II I I I I I I I I I I I I M I I I : I I I I I I I I I I I 1 I I 1 I I :: I I I I II : I I II I M 
al4 6 D ADG FGQR I SN LS RALN VD FQN H V IT CRRQR IHT LRAC AV I VAE H VRVFQKS LLRDKRLK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 6 . pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 
I I I I I I II M II I I J M I I : II I I I II I I I I ! I I ! I I I i I I I I I I ! 1 I I I I I I I I I I I ! I 
al4 6 LFFGNKVIMYAVCFAFTRRTRRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

130 140 150 160 . 170 180 

190 200 210 

ml4 6 . pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 
I I I I I I I II M I I M M : t i M I I I I M M I I 
al4 6 GHIFYLYIFQPIVSQRTPGFLFADAHILPLLFX 

190 200 210 

The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 583>: 

gl47.seq (partial) 

1 . . ATGCGACGAG AAGCCAAAAT GGCACAAATC ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CGCCCAAGCG CATGAAACTG 

101 AGCAATCGGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCGACTT CGGGGCTGCT GCACACTTCG ACCGCCTCCG ACAAAATCAT 

201 CTCCGGCGAT ACTTTGCGCC AAAAAGCCGT CAACTTGGGC GACGCTTTGG 

251 ACGGCGTACC GGGCATCCAC GCTTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC GGGCAGACGG ATTAAAGTAT TGAACCATCA 

351 CGGCGAAACG GGCGATATGG CGGACTTTTC TCCCGATCAC GCCATTATGG 

4 01 TAGATACCGC CTTGTCGCAA CAGGTTGAAA TCCTGCGCGG GCCGGTTACG 

4 51 CTCTTGTACA GCTCGGgcaa tgtggccgGG GCTGGtcaat gttgccgatg 

501 gAAAAAtccc ccaaaaAAtg cc . . 

This corresponds to the amino acid sequence <SEQ ID 584; ORF 147.ng>: 

gl47.pep (partial) 

1 . .MRREAKMAQI TLKPIVLSIL LINTPLLAQA HETEQSVGLE TVSWGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDTALSQ QVEILRGPVT 

151 LLYSSGNVAG AGQCCRWKNP PKNA. . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 585>: 

ml47*seq (partial) 

1 . . CCGCATAAAA CTGAGCAATC GGTGGATTTG GAAACGGTCA GCGTCGTCGG 

51 CAAAAGCCGT CCGCGCGCCA CGTCGGGGCT GTTGCACACT TCGACCGCCT 

101 CCGACAAAAT CATCTCCGGC GATACCTTGC GCCAAAAAGC CGTCAACTTG 

151 GGCGACGCTT TAGACGGCGT ACCGGGCATC CACGCTTCGC AATACGGCGG 

201 CGGCGCGTCT GCTCCCGTCA TTCGCGGTCA AACAGGCAGG CGGATTAAAG 

251 TGTTGAACCA TCACGGCGAA ACAGGCGATA TGGCGGATTT TTCGCCCGAT 

301 CACGCCATTA TGGTAGATAC CGCCTTGTCG CAACAGGTCG AAATCCTGCG 

351 CGGGCCGGTT ACGCTCTTGT ACAGCTCGGG CAATGTGGCG GGGCTGGTCG 

4 01 ATGTTGCCGA TGGCAAAATC CCCGAAAAAA TGCCTGAAAA CGGCGTATCG 

4 51 GGCGAACTCG GATTGCGTTT GAGCAGCGGC AATCTGGAAA AACTCACGTC 

501 CGGCGGCATC AATATCGGTT TGGGCAAAAA CTTTGTATTG CACACGGAAG 

551 GGCTGTACCG CAAATCGGGG GATTACGCCG TACCGCGTTA CCGCAATCTG 

601 AAACGCCTGC CCGACAGCCA CGCCGATTCG CAAACGGGCA GCATCGGGCT 

651 GTCTTGGGTT GGCGAAAAAG GTTTTATCGG CGTAGCGTAC AGCGACCGTC 

7 01 GCGACCAATA TGGTCTGCCT GCCCACAGCC ACGAATACGA TGATTGCCAC 

7 51 GCCGACATCA TCTGGCAAAA GAGCTTGATT AACAAACGCT ATTTACAGCT 

801 TTATCCGCAC CTGTTGACCG AAGAAGACAT CGATTACGAC AATCCGGGCT 

851 TGAGCTGCGG CTTCCACGAC GACGATAATG CACACGCACA CACCCACAGC 

901 GGCAGACCGT GGATAGACCT GCGCAACAAA CGCTACGAAC TCCGTGCCGA 

951 ATGGAAGCAA CCGTTCCCCG GTTTTGAAGC CCTGCGCGTA CACCTGAACC 

1001 G C AACG ACT A CCGCCACGAC GAAAAAGCAG GCGATGCAGT CGAAAACTTT 

1051 TTTAACAACC AAACGCAAAA CGCCCGCATC GAGTTGCGCC ACCAACCCAT 
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1101 AGGTCGTCTG 

1151 GTGCTTTATC 

1201 AACAAAGTGC 

1251 CAACTTCACG 

1301 TTCAGTACGA 

1351 CTGCCCGACC 

14 01 GGGCAACTGG 

14 51 ATCAGGAACG 

1501 GTCGCCACCA 

1551 TTCCAACAAT 

1601 ACAATCTGGC 

1651 ACCTTAAACG 

17 01 GAAGCTCGTG 

17 51 GCGAAATCTA 

1801 GACTATGTAC 

1851 AGATGCCTAC 

1901 CCCGTGTTCC 

1951 GACCGTATCG 

2001 ACTCGCCCGC 

2051 GCGCAAACTA 

2101 AAAGCCGACA 

2151 CTCTGATACG 

2201 AGTTTTAA 



AAAGGCAGCT GGGGCGTGCA 
TGCCATATCC GAAGCGGTTA 
AACATTACAG CTTTTTCGGT 
CTTGAAGGAG GCGTACGCGT 
CAAAGCATTG ATTGATCGGG 
TCGGCGCGCA CCGCCAAACC 
TATTTCACGC CACAACACAA 
CCTGCCGTCA ACGCAAGAGC 
ACACCTTTGA AGTCGGCAAC 
ATCGAACTCG CGCTGGGCTA 
ACTCTACCGC AACCGCTTCG 
ACGGACGCGG CCCCAAATCC 
CGCTACAACC AATCCGGCGC 
CTTCAAACCG ACACCGCGCT 
GAGGCCGTCT GAAAAACCTG 
GGCAACCGTC CTTTCATCGC 
GGCTGCGCGC CTCGGCTTCC 
ATGCCAATTT GGACTACTAC 
TACGAAACGC GCACGCCCGG 
CCGCCGCAAT ACGCGCTATG 
ACCTGCTCAA CCAATCCGTT 
CCGCAAATGG GCCGCAGCTT 



ATATTTACAA CAAAAATCCA 
AACAACCGAT GCTGCTTGAC 
GTAGAACAGG CAAACTGGGA 
GGAAAAACAA AAAGCCTCCA 
AAAACTACTA CAACCACCCC 
GCCCGCTCAT TCGCACTTTC 
ACTCAGCCTG ACCGCCTCCC 
TGTACGCACA CGGCAAACAC 
AAACACCTCA ACAAAGAGCG 
CGAAGGCGAC CGCTGGCAAT 
GTAACTACAT TTACGCCCAA 
ATCGAAGACG ACAGCGAAAT 
CGACTTCTAC GGCGCGGAAG 
ACCGCATCGG CGTTTCCGGC 
CCTTCCCTAC CCGGCAGAGA 
ACAGGACGAC CAAAATGCCC 
ACCTGAAAGC CTCGCTGACC 
CGCGTGTTCG CCCAAAACAA 
AC AC C AT AT G CTCAACCTCG 
GCGAGTGGAA TTGGTACGTC 
TACGCCCACA GCAGCTTTCT 
TACCGGCGGC GTGAACGTGA 



This corresponds to the amino acid sequence <SEQ ID 586; ORF 147>: 



ml47 



pep 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 



(partial ) 

. . PHKTEQSVDL 
GDALDGVPGI 
HAIMVDTALS 
GELGLRLSSG 
KRLPDSHADS 
ADIIWQKSLI 
GRPWIDLRNK 
FNNQTQNARI 
NKVQHYSFFG 
LPDLGAHRQT 
VATNTFEVGN 
TLNDGRGPKS 
DYVRGRLKNL 
DRIDANLDYY 
KADNLLNQSV 



ETVSVVGKSR 
HASQYGGGAS 
QQVEILRGPV 
NLEKLTSGGI 
QTGSIGLSWV 
NKRYLQLYPH 
RYELRAEWKQ 
ELRHQPIGRL 
VEQANWDNFT 
ARSFALSGNW 
KHLNKERSNN 
IEDDSEMKLV 
PSLPGREDAY 
RVFAQNKLAR 
YAHSSFLSDT 



PRATSGLLHT 
APVIRGQTGR 
TLLYSSGNVA 
NIGLGKNFVL 
GEKGFIGVAY 
LLTEEDIDYD 
PFPGFEALRV 
KGSWGVQYLQ 
LEGGVRVEKQ 
YFTPQHKLSL 
IELALGYEGD 
RYNQSGADFY 
GNRPFIAQDD 
YETRTPGHHM 
PQMGRS FTGG 



STASDKIISG 
RIKVLNHHGE 
GLVDVADGKI 
HTEGLYRKSG 
SDRRDQYGLP 
NPGLSCGFHD 
HLNRNDYRHD 
QKSSALSAIS 
KASIQYDKAL 
TASHQERLPS 
RWQYNLALYR 
GAEGEIYFKP 
QNAPRVPAAR 
LNLGANYRRN 
VNVKF* 



DTLRQKAVNL 
TGDMADFSPD 
PEKMPENGVS 
DYAVPRYRNL 
AHSHEYDDCH 
DDNAHAHTHS 
EKAGDAVENF 
EAVKQPMLLD 
IDRENYYNHP 
TQELYAHGKH 
NRFGNYIYAQ 
TPRYRIGVSG 
LGFHLKASLT 
TRYGEWNWYV 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



m!47 / gl47 92.3% identity in 142 aa overlap 



10 20 30 

ml 4 7 . pep PHKTEQSVDLETVSWGKSRPRATSGLLHTS 

1 : I I I I i I I I I J I I I I I I II I It M I I I I 
gl4 7 MRRE AKMAQI T LKP I VLS ILL I NT PLLAQAHETEQ SVG LET VSVVGKSR PRAT SGLLHTS 

10 20 30 40 50 60 



40 50 60 70 80 90 

ml 4 7 . pep TASDKI ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

I I I I I I I I II I I I I I i I I M ! I I I I I I II I I I II I I I I I I I I II I I I I M I I I I I I I II I 
gl4 7 TAS DK 1 1 SGDTLRQKAVNLGDALDGVPG I HASQYGGGAS APVI RGQTGRR I KVLNHHGET 

70 80 90 100 110 120 



100 110 120 130 140 150 

ml 47 . pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 

I I I I 1 I I I I II M I I I I I I I I I I I I I I I I M I I I I I I I I I : I I I 

gl47 GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGAGQCCRWKNPPKNA 
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The following partial DNA sequence was identified in K meningitidis <SEQ ID 587>: 

al47 . seq 

1 ATGCGACGAG AAGCCAAAAT GGCACAAACT ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CTCCCAAGCG CATGGAACTG 

101 AGCAATCAGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCCACTT CGGGGCTGCT GCACACTTCT ACCGCCTCCG AC AAAAT CAT 

201 CAGCGGCGAC ACCTTGCGAC AAAAAGCCGT CAACTTGGGT GATGCTTTAG 

251 ACGGCGTACC GGGCATTCAT GCCTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC AGGCAGACGG ATTAAAGTGT TGAACCATCA 

351 CGGCGAAACG GGCGACATGG CGGACTTCTC TCCAGACCAT GCAATCATGG 

4 01 TGGACAGCGC CTTGTCGCAA CAGGTCGAAA TCCTGCGCGG TCCGGTTACG 

451 CTCTTGTACA GCTCGGGCAA TGTGGCGGGG CTGGTCGATG TTGCCGATGG 

501 CAAAATCCCC GAAAAAATGC CTGAAAACGG CGTATCGGGC GAACTCGGAT 

551 TGCGTTTGAG CAGCGGCAAT CTGGAAAAAC TCACGTCCGG CGGCATCAAT 

601 ATCGGTTTGG GCAAAAACTT TGTATTGCAC ACGGAAGGGC TGTACCGCAA 

651 ATCGGGGGAT TACGCCGTAC CGCGTTACCG CAATCTGAAA CGCCTGCCCG 

701 ACAGCCACGC CGATTCGCAA ACGGGCAGCA TCGGGCTGTC TTGGGTTGGC 

7 51 GAAAAAGGCT TTATCGGCGC AGCATACAGC GACCGTCGCG ACCAATATGG 

801 TCTGCCTGCC CACAGCCACG AATACGATGA TTGCCACGCC GACATCATCT 

851 GGCAAAAGAG TTTGATTAAC AAACGCTATT TGCAGCTTTA TCCGCACCTG 

901 TTGACCGAAG AAGACATCGA TTACGACAAT CCGGGCTTGA GCTGCGGCTT 

951 TCACGACGAC GATGATGCAC ACGCCCATGC CCACAACGGC AAACCTTGGA 

1001 TAGACCTGCG CAACAAACGC TACGAACTCC GCGCCGAATG GAAGCAACCG 

1051 TTCCCCGGTT TTGAAGCCCT GCGCGTACAC CTGAACCGCA ACGACTACCG 

1101 CCACGACGAA AAAGCAGGCG ATGCAGTAGA AAACTTTTTT AACAACCAAA 

1151 CGCAAAACGC CCGTATCGAG TTGCGCCACC AACCCATAGG CCGTCTGAAA 

12 01 GGCAGCTGGG GCGTGCAATA TTTGGGACAA AAATCCAGTG CTTTATCTGC 

1251 CACATCCGAA GCGGTCAAAC AACCGATGCT GCTTGACAAT AAAGTGCAAC 

1301 AT T AC AG CT T TTTCGGTGTA GAACAGGCAA ACTGGGACAA CTTCACGCTT 

1351 GAAGGCGGCG TACGCGTGGA AAAACAAAAA GCCTCCATCC GCTACGACAA 

14 01 AGCATTGATT GATCGGGAAA ACTACTACAA CCATCCCCTG CCCGACCTCG 

14 51 GCGCGCACCG CCAAACCGCC CGCTCATTCG CACTTTCGGG CAACTGGTAT 

1501 TTCACGCCAC AACACAAACT CAGCCTGACC GCCTCCCATC AGGAACGCCT 

1551 GCCGTCAACG CAAGAGCTGT ACGCACACGG CAAACACGTC GCCACCAACA 

1601 CCTTTGAAGT CGGCAACAAA CACCTCAACA AAGAGCGTTC CAACAATATC 

1651 GAACTCGCGC TGGGCTACGA AGGCGACCGC TGGCAATACA ATCTGGCACT 

1701 CTACCGCAAC CGCTTCGGCA ACTACATTTA CGCCCAAACC TTAAACGACG 

17 51 GACGCGGCCC CAAATCCATC GAAGACGACA GCGAAATGAA GCTCGTGCGC 

1801 TACAACCAAT CCGGTGCGGA CTTCTACGGC GCGGAAGGCG AAATCTACTT 

1851 CAAACCGACA CCGCGCTACC GCATCGGCGT TTCCGGCGAC TATGTACGAG 

1901 GCCGTCTGAA AAACCTGCCT TCCCTACCCG GCAGGGAAGA CGCCTACGGC 

1951 AACCGCCCAC TCATTGCCCA AGCCGACCAA AACGCCCCTC GCGTTCCGGC 

2001 TGCGCGCCTC GGCGTCCACC TGAAAGCCTC GCTGACCGAC CGCATCGATG 

2051 CCAATTTGGA CTACTACCGC GTGTTCGCCC AAAACAAACT CGCCCGCTAC 

2101 GAAACGCGCA CGCCCGGACA CCATATGCTC AACCTCGGCG CAAACTACCG 

2151 CCGCAATACG CGCTATGGCG AGTGGAATTG GTACGTCAAA GCCGACAACC 

2201 TGCTCAACCA ATCCGTTTAC GCCCACAGCA GCTTCCTCTC TGATACGCCG 

2251 CAAATGGGCC GCAGCTTTAC CGGCGGCGTG AACGTGAAGT TTTAA 

This corresponds to the amino acid sequence <SEQ ID 588; ORF 147. a>: 

al47 .pep 

1 MRREAKMAQT TLKPIVLSIL LINTPLLSQA HG TEQSVGLE TVSVVGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDSALSQ QVEILRGPVT 

151 LLYSSGNVAG LVDVADGKIP EKMPENGVSG ELGLRLSSGN LEKLTSGGIN 

201 IGLGKNFVLH TEGLYRKSGD YAVPRYRNLK RLPDSHADSQ TGSIGLSWVG 

251 EKGFIGAAYS DRRDQYGLPA HSHEYDDCHA DIIWQKSLIN KRYLQLYPHL 

301 LTEEDIDYDN PGLSCGFHDD DDAHAHAHNG KPWIDLRNKR YELRAEWKQP 

351 FPGFEALRVH LNRNDYRHDE KAGDAVENFF NNQTQNARIE LRHQPIGRLK 

4 01 GSWGVQYLGQ KSSALSATSE AVKQPMLLDN KVQHYSFFGV EQANWDNFTL 

4 51 EGGVRVEKQK ASIRYDKALI DRENYYNHPL PDLGAHRQTA RSFALSGNWY 

501 FTPQHKLSLT ASHQERLPST QELYAHGKHV ATNTFEVGNK HLNKERSNNI 

551 ELALGYEGDR WQYNLALYRN RFGNYIYAQT LNDGRGPKSI EDDSEMKLVR 
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601 YNQSGADFYG AEGEIYFKPT PRYRIGVSGD YVRGRLKNLP SLPGREDAYG 

651 NRPLIAQADQ NAPRVPAARL GVHLKASLTD RIDANLDYYR VFAQNKLARY 

7 01 ETRTPGHHML NLGANYRRNT RYGEWNWYVK ADNLLNQSVY AHSSFLSDTP 

7 51 QMGRSFTGGV NVKF* 



m!47/al47 



98.1% identity in 734 aa overlap 



10 20 30 

ml 4 7 . pep PHKTEQSVDLETVS WGKSRPRATSGLLHTS 

I I I I I I I I I I I I I i I I M I M I I I 1 I I I 
al4 7 MRREAKMAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVS WGKSRPRATSGLLHTS 

10 20 30 40 50 60 

40 50 60 70 80 90 

ml47 . pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
I I I M I I M I I I I I I M I I I I I M I I I I M I I I I II I II I M I I I I 9 I 11 I I t I f M M I 
al47 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

70 80 90 100 110 120 

100 110 120 130 140 150 

ml47 . pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
I I I I I I I I I I I I I I I : I I I t I I I I I ! I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I 
al47 GDMADFSPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 

130 140 150 160 170 180 

160 170 180 190 200 210 

ml4 7.pep ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I 
al4 7 ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 

190 200 210 220 230 240 

220 230 240 250 260 270 

ml47 .pep TGSIGLSWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 
I I I I I I I I I I I I I I I I : I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I II I I II I I 
al4 7 TGSIGLSWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADI IWQKSLINKRYLQLYPHL 

250 260 270 280 290 300 

280 290 300 310 320 330 

ml4 7 .pep LTEEDIDYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVH 
I I 1 I If i M I I I I I I I I I I I I : I I I I : I : I : I I I I i I I M I I I I I I I II I I I II I 11 I I I 
al4 7 LTEEDIDYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVH 

310 320 330 340 350 360 

34 0 350 360 370 380 3 90 

ml 4 7 . pep LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISE 
I I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I 1 I 1 II 
al4 7 LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSE 

370 380 390 400 410 420 

400 410 420 430 440 450 

ml47 . pep AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPL 
I I I I I I I I I I I I M I I I I I I I I I I II i I I I I I I I I I I I I II M : I I I I I I I I I I M I II I 
al 4 7 AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPL 

430 440 450 460 470 480 

460 470 480 490 500 510 

ml 4 7 . pep PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 
I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I II M I I I I I I I 
al 4 7 PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 

490 500 510 520 530 540 



520 530 54 0 550 560 570 

ml 4 7 .pep HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVR 
M I I I I M I I I I I f M M I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I f i II I I I I I I I I I I 
al4 7 HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVR 



WO 99/57280 



PCT/US99/09346 



416 



550 560 570 580 590 600 

580 590 600 610 620 630 

YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQ 

| [ | | | I | I I M I I I I I I I I M I I I II I I I II I I M II I I I I M I I I 1 I I I I II : I M t I 
YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQ 

610 620 630 640 650 660 

640 650 660 670 680 690 

NAPRVPAARLGFHLKASLTDRI DANLDYYRVFAQNKLARYETRT PGHHMLNLGANYRRNT 

| | I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 1 I I I I I I M I I I I I I 

NAPRVPAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNT 

670 680 690 700 710 720 

700 710 720 730 

RYGEWNW YVKADNLLNQS V Y AHS S FLS DT PQMGRS FTGG VNVKFX 

M | | I I IJ [ I I I I I I I 1 I I I I I M 11 I I I I I I I I I I I I I I I I M I 
RYGEWNWYVKADNLLNQSVYAHSS FLS DT PQMGRS FTGG VNVKFX 
730 740 750 760 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 589>: 

gl48 . seq 

1 ATGGCGTTAA AAAC AT C AAA CTTGGAACAC GCAATGctgg ttcaTCCCGA 

51 AgctATgagt gtcggcgCGC TTGccgAcaa AATCCGCAAA AtcgaAAact 

101 gGCCGCAAAA AGgcaTCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGT 

151 GCGGAATACT TCCGCCTTTT GGTCGATTTG CTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGCTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAaCGtcg gctTCGTCCC CATC CGC AAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTAcg cgcTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGa tgccgTCAAA CCCGGTTCGC 

4 01 GCGTCCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC AATGCTTGCC 

4 51 GGGCTGGAAC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAgccgccgC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGCGCAAGTG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGCAT GAAAGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 590; ORF 148.ng>: 

g!4 8 -pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHT DAVK PGSRVLLVDD LVAT GGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 59 1>: 

ml48 . seq 

1 ATGGCGTTAA AAAC AT C AAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGCGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTTCAAAGC 

151 GCGGAATACT TCCGCCTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGATTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAAC TGATCCGCAA ACTCGGCGGA GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 592; ORF 148>: 

ml 4 8 . pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 



ml 4 7 , pep 
al47 

ml 4 7 .pep 
al47 

ml 4 7 .pep 
al47 
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101 KGKLPFETVS QS YALEYGEA AVEIHTDAVK LGSRVLLVDD LIATGGTMLA 
151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml48 / gl48 99.0% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 . pep MALKT SNLEHAMLVH PE AMS VGALADKI RKI ENWPQKG I LFHD I T PVLQSAE YFRLLVDL 

I M I I I I f M I I I I M I M I I I! I I I I I M I I I I i I I M I M M I I I I I M I I I I I I I I I 
g 14 8 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 8 . pep LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQS YALEYGEA 

I I I I I I I I I i i I I II i I t It I I t I M t I I I I I I M I I I I I I I I II M I i I I I I ! I I I I I I 
g 14 8 LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 8. pep AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 
I I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I i I I I I I II I I I I I I I I I I I I I I I I I I I 
g 14 8 AVEIHTDAVKPGSRVLLVDDLVATGGTMLAGLELIRKLGGEIVEAAAILE FTDLQGGKNI 

130 140 150 160 170 180 

190 200 
ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
I I I I I I I M I I I I I I I M I I 
gl 4 8 RASGAPLFTLLQNEGCMKGX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 593>: 

al48 . seq 

1 ATGGCGTTAA AAAC AT C AAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGTGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGC 

151 GCGGAATACT TCCGACTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAGC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 594; ORF 148.a>: 

al48 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AE YFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QS YALEYGEA AVEIHTDAVK LGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

ml48/al48 99.5% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 . pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 
I I I I I I I I I I I I I I I I I I I I I I M M I II I I I II I I I I I 1 I I I I J I I I I I I I I I I I I I I I 
al4 8 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml 4 8 . pep LVYRYMDQKIDIVAGLDARGFI IGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

I I i II I I M I I I I I I I I i I I I I I I I I M M I I I I 1 I I I 1 M 1 M I I I I II I I I ! I I I I M 
al4 8 LVYRYMDQKIDIVAGLDARGFI I GAALAYQLNVGFVP I RKKGKLPFETVSQS YALE YGEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 8 . pep AVE I HT DAVKLGSRVLLVDDL I ATGGTMLAGLEL IRKLGGE I VEAAAI LE FT D LQGGKN I 

I I I I I I I I I II I 1 I I I I I I ! t : M I I I I M I I ! I I I I I ! I I I II I I I I I 1 II I I I I I I II 
al4 8 AVE IHTDAVKLGSRVLLVDDLVATGGTMLAGLELIRKLGGE I VEAAAI LEFTDLQGGKNI 

130 140 150 160 - 170 180 

190 200 
ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
I I I I I I M I I I I I I I I I I I I 
a!4 8 RASGAPLFTLLQNEGCMKGX 

190 200 



The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 595>: 

gl49 . seq 

1 ATGT TGATTG ACAACAATGT CCGCCATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAATTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGGTAC GACAAAGCAT TGATTGATCG AGAAAACTAC 

151 TACAACCAGC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 GTTCGCACTT TCGGGCAACT GGTATTTCAC GCCACACCAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAa cgCCTGCCGT CAACGCa ag A actGtACgca 

3 01 cacggcAAGC ACGtcgccac CAACACCTTT GAagtcggca acaaACACCT 

3 51 CAACAAAGaG CgttccaacA atatcgaACT CGCGCTGGgc tAcaaaggcg 

401 accGCTGGCA ATACAATCTG GCAGCCTACC GCAACCGA t T CGGCAACTAC 

451 ATTTACGCCC AAACCTTAaa cgacggacgC GGCCCCAAAT CCATCgaaga 

501 cgacagcgaA ATGaagcTCG TGCGCTACAA CCAATCCGGT GCCGACTTCT 

551 ACGgcgcggA aggcgaaatc tACTTcaaaC CGAcACCGCG CTACCGCATC 

601 GGTGTTTCCG GCGACTatgt acgaggccgT CTGAAAAACC TGCCGTCCCT 

651 ACCCGGCAGG gaagatccCT AcggcAAACG TCccttcaTC GCACAAGCCG 

701 AC CAAAACGC CCCCCGCATT ccggctGCGC GCCTCGGCTT CCACCTGAAA 

751 ACCTCGCTAA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGTACGCCC GGACACCATA 

851 TGCTCAACCT CGGTGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGC CGA CAACCTGCtc aACcaatCcg tTTACGCCCa 

951 cAGCAGCTTC CTCTCTGATA CGCCGCAAAt gGGCCGCAGC TTtgccgGCg 

1001 gcgtaAACGT GaAGTTttaA 

This corresponds to the amino acid sequence <SEQ ID 596; ORF 149.ng>: 
gl49 -pep 

1 ML I DNNVRH Y SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNQPLPDLGA HRQTARS FAL SGNWYFTPHH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YKGDRWQYNL AAYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDPYGKRPFI AQADQNAPRI PAARLGFHLK 

251 TSLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

3 01 MWYVKADNLL NQSVYAHSSF LSDTPQMGRS F AGGVNVKF * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 597>: 

ml49 . seq 

1 ATGCTGCTTG ACAACAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG AGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATTCAGTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACCACC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

401 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGTAACTAC 
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4 51 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAG CGAA ATGAAG CTCG TGCGCTACAA CCAATCCGGC GCCGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

6 01 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGA GAAGATGCCT ACGGCAACCG TCCTTTCATC GCACAGGACG 

701 ACCAAAATGC CCCCCGTGTT CCGGCTGCGC GCCTCGGCTT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

8 01 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

8 51 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

951 CAGCAGCTTT CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGT TTTAA 

This corresponds to the amino acid sequence <SEQ ID 598; ORF 149>: 

ml4 9.pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIQY DKALIDRENY 

51 YNHPLPDLGA HRQTARS FAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIEIiALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDAYGNRPFI AQDDQNAPRV PAARLGFHLK 

251 AS LTDRI DAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

3 01 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 149 shows 95.9% identity over a 339 aa overlap with a predicted ORF (ORF 149.ng) 
from N. gonorrhoeae: 

ml49/gl49 



10 20 30 40 50 60 

ml4 9 . pep MLLDNKVQH YS FFGVEQ ANWDNFTLEGGVRVE KQKAS I Q YDKAL I DREN Y YNHP LPDLGA 

I M l-MM II MM III II I II M Ml MMMIMMIMI IMIII MM III I II 

gl4 9 MLIDNNVRHYSFFGVEQANWDNFTLEGGVRVE KQKAS IRYDKALIDRENYYNQPLPDLGA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m!4 9 . pep HRQTARSFAIiSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

1 1 M I Ml I MM IIMIMM M I M IMIIMI MM I MIM II I MM MM MM 

gl4 9 HRQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 9 . pep RSNNI ELALGYEGDRWQYNIiALYRNRFGNYI YAQTLNDGRGPKS I EDDSEMKLVRYNQSG 

I II Ml I Ml I Ml I I I I I I I I I Mill MMII MIM M lllllll I M I I I II M I 

gl49 RSNNIELALGYKGDRWQYNLAAYRNRFGNYIYAQTLNDGRGPKS I EDDSEMKLVRYNQSG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml4 9 . pep AD FYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFI AQDDQNAPRV 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : M I I I I I I I I II -• 

gl49 ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRI 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 4 9 . pep PAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

MMIIMIMMMI IMMI I I MIMIMI III I I M M I II I I I I II Ml III I I II 

gl4 9 PAARLGFHLKTSLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

250 260 270 280 290 300 



310 320 330 340 

ml4 9 . pep NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

III IIIIIMIII MM MIMIM MMMMIIIIIM 
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gl4 9 NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFAGGVNVKFX 

310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 599>: 

al49 . seq 

1 ATGCTGCTTG ACAATAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGCTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACC AT C CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

4 01 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGCAACTAC 

4 51 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGT GCGGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG G CG AC T AT G T ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGG GAAGACGCCT ACGGCAACCG CCCACTCATT GCCCAAGCCG 

7 01 ACCAAAACGC CCCTCGCGTT CCGGCTGCGC GCCTCGGCGT CCACCTGAAA 

7 51 GCCTCGCTGA CCGACCGCAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

951 CAGCAGCTTC CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 600; ORF 149.a>: 

al4 9.pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNHPLPDLGA HRQTARS FAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDAYGNRPLI AQADQNAPRV PAARLGVHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

ml49/al49 98.8% identity in 339 aa overlap 

10 20 30 40 50 60 

ml 4 9 . pep MLLDNKVQHY SFFGVEQANWDNFTLEGGVRVEKQKAS I QYDKAL I DRENYYNHPLPDLGA 

i I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I : I It I I I I I I I I I I I I | | | | | [ 
al4 9 MLLDNKVQHY SFFGVEQANWDNFTLEGGVRVEKQKAS I RYDKAL I DRENYYNHPLPDLGA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 9 . pep HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 
I I M II I I I I I II I I I I I I I I I I I I | f I M I I I M M I I I II I I I I M I I I I I I M I I I I 
al 4 9 HRQTARS FALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 9 . pep RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 
I I I I I I I I I M I I I I I I I I I I M I I I M M I I M I I I I M I I I I M II M I I II I M I I I 
al 4 9 RSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 9 . pep ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRV 

M I I I I I II I I I I I I I I I I M If I I I I I I I I I I I I I I I I I I I I I I j M : I I I II Mill 
al4 9 ADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQNAPRV 

190 200 210 220 230 240 



ml 4 9 . pep 



250 260 270 280 290 300 

PAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 
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MINI M I I I I I I I I I I I I II I I | I I I I I I j | | | If | M I I II I I I I I I I I I I It I I I 
al4 9 PAARLGVHLKASLTDRIDANLDYYRVFAQNK1ARYETRTPGHHMLNLGANYRRNTRYGEW 

250 260 270 280 290 300 

310 320 330 340 

ml 4 9 .pep NWYVKADNLLNQSVYAHSS FLS DTPQMGRS FTGGVNVKFX 

I I I f I I I I I I I I I I I IE I I I I I I I I I I I I I I I I I I I I I I I 
al4 9 NWYVKADNLLNQSVYAHSS FLS DTPQMGRS FTGGVNVKFX 

310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 601>: 

gl49-l.seq 

1 ATGGCACAAA TCACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCGAC TTCGGGGCTG 

151 CTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACTTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT GGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACGGGCAGAC GGATTAAAGT ATTGAACCAT CACGGCGAAA CGGGCGATAT 

351 GGCGGACTTT TCTCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTTGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGAAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGaagccgG ATTGCGTTTG AGCAGCGGCA 

551 ATTTAGAAAA ACTGACATCC GCAGGCATCA ATATCGGACT GGGCAAAAAC 

601 TTCGTGCTGC ATACCGAAGG CTTGTACCGC AAATCGGGCG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAT GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTGG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCGCTAC GGCCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATCA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACT TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG GCGACGGTGC 

951 ACACGCACAC ACCCACAACG GCAAACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAGC CATTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ATCTGAACCG CAATGACTAC CACCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTCT TCAACAACAA AACACACAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG CGCGCTTTCC GCCATTCCCG AAACCGTCCA 

1251 ACAACCGATG TTGATTGACA AC AATGTCC G CCATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAATT GGG AC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGGTACGAC AAAGCATTGA TTGATCGAGA 

1401 AAACTACTAC AACCAGCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCGTT CGCACTTTCG GGCAACTGGT ATTTCACGCC AC AC C AC AAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAACT 

1551 GTACGCACAC GGCAAGCACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA GCCTACCGCA ACCGATTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGT GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CGTCCCTACC CGGCAGGGAA GATCCCTACG GCAAACGTCC CTTCATCGCA 

1951 CAAGCCGACC AAAACGCCCC CCGCATTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAACC TCGCTAACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG TACGCCCGGA 

2101 CACCATATGC TCAACCTCGG TGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TC T GAT AC GC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TAAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 602; ORF 149-l.ng>: 

gl49-l.pep 

1 MAQITLKPIV LSILLINTPL LAQA HETEQS VGLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGEAGLRL SSGNLEKLTS AGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYS DRRDRY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDGDGAHAH THNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY HHDEKAGDAV ENFFNNKTHN ARIELRHQPI GRLKGSWGVQ 
401 YLGQKSSALS AIPETVQQPM LIDNNVRHYS FFGVEQANWD NFTLEGGVRV 
451 EKQKASIRYD KALIDRENYY NQPLPDLGAH RQTARSFALS GNWYFTPHHK 
501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 
551 EGDRWQYNLA AYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 
601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DPYGKRPFIA 
651 QADQNAPRIP AARLGFHLKT SLTDRIDANL DYYRVFAQNK LARYETRTPG 
701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 
751 TGGVNVKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 603>: 

ml 4 9-1 , seq 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGATTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC GTCGGGGCTG 

151 TTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACCTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT AGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCGTCTG CTCCCGTCAT TCGCGGTCAA 

301 ACAGGCAGGC GGATTAAAGT GTTGAACCAT CACGGCGAAA CAGGCGATAT 

351 GGCGGATTTT TCGCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTCGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG TTTTATCGGC 

751 GTAGCGTACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGCTTGATTA 

851 ACAAACGCTA TTTACAGCTT TATCCGCACC TGTTGACCGA AG AAG AC AT C 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG ACGATAATGC 

951 ACACGCACAC ACCCACAGCG GCAGACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGTGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTC GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGCATCG 

1151 AGTTGCGCCA CCAACCCATA GGTCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTACAAC AAAAATCCAG TGCTTTATCT GCCATATCCG AAGCGGTTAA 

1251 ACAACCGATG CTGCTTGACA ACAAAGTGCA ACATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGAGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT TCAGTACGAC AAAGCATTGA TTGATCGGGA 

1401 AAACTACTAC AACCACCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 TAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGCGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGAGAA GATGCCTACG GCAACCGTCC TTTCATCGCA 

1951 CAGGACGACC AAAATGCCCC CCGTGTTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTTCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 604; ORF 149-1 >: 

ml49-l .pop 

1 MAQTTLKPIV LSILLINTPL LAQA HETEQS VDLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 VAYSDRRDQY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDNAHAH THSGRPWIDL RNKRYELRAE WKQPFPGFEA 

351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

4 01 YLQQKSSALS AISEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIQYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 
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551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE 

651 QDDQNAPRVP AARLGFHLKA SLTDRIDANL DYYRVFAQNK 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL 

751 TGGVNVKF* 

ml49-l/gl49-l 96.2% identity in 758 aa overlap 

10 20 30 40 50 60 

ml4 9-l.pep MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 
Ml I I I I ! I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
g 1 4 9 - 1 MAQI TLKPI VLS I LL I NT PLLAQAHETEQSVGLETVS WGKS RPRAT S GLLHTSTAS DKI 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 9-1. pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I II I I I II I I I I I I I t 
gl 4 9- 1 ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 9-1. pep SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
I i I I I I M I I I I I I I I I I I I I I I I i I I I I I I I 1 I I I I I ! I I I I I I I I I I I I I I I I I I I I 
gl49-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGEAGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml49-l.pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I | | | || I | | I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I 
gl4 9-l SSGNLEKLTSAGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml49-l.pep SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
I I U I I I I II : I I I I I I I : I I I I I I I I I I I I I I I I II I I I i I I I ! I I i I I I I I ! I I I I I I 
gl49-l SWVGEKGFIGAAYSDRRDRYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml49-l.pep DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
I I I I I I I I I I I I I I : I I I I I I : I : I I I I M I M I I I I I ! I I I M I I I i i I I I I I I I I I I 
g 1 4 9 - 1 DYDNPGLSCGFHDGDGAHAHTHNGKPWI DLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml49-l .pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 
: 1 I I I I I I I I I I I I I I : I : I I I I I I I I I II I I II I I I I I I II I I I I i I I I I I : I : I It 
g 1 4 9- 1 HHDEKAGDAVENFFNNKTHNARI ELRHQPI GRLKGS WGVQYLGQKS SALS AI PETVQQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 4 9-1 . pep LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGAH 
1 : I j : I : I I I I I I I I I I I I I I j I i I f I I 1 I I I I I ! M : I I It I t I I I I I I I : I I I I I I I I 
g 14 9-1 LIDNNVRHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNQPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 4 9-1 . pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
I I I I I I I I i M I I I I I I : I I I I I 1 I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
g 14 9-1 RQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 4 9-1. pep SNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
I I I I I I I I I I I I I I E II I 1 I I I I I I I I I I I I I I I I I I I 1 I 1 I I I It I I I I I I I I I I I I I 
gl4 9-l SNNIELALGYEGDRWQYNLAAYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 4 9-1. pep DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRVP 
I I I I I I ! I I I I I I I I I E I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I t I I II I I I : I 
gl4 9-l DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRIP 

610 620 630 640 650 660 



KLVRYNQSGA 
DAYGNRPFIA 
LARYETRTPG 
SDTPQMGRSF 
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670 680 690 700 710 720 

ml 4 9-1 . pep AARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 
I I I I I I I I I : I I I I I 1 I I i I I I I i I I I I I I I I I I I I I t I I I 1 ! I I I I I I I I I I I I I I I I I 
gl4 9-l AARLGFHLKTSLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 

670 680 690 700 710 720 

730 740 750 759 

ml 4 9- 1 . pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
I I I I I I I I I I I I I I I I I I I II I I I 1 I I I I I I I I I I I I I I 
gl49-l WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

730 740 750 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 605>: 

al49-l . seq 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCTCCCAAG CGCATGGAAC TGAGCAATCA GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC TTCGGGGCTG 

151 CTGCACACTT CTACCGCCTC CGACAAAATC ATCAGCGGCG ACACCTTGCG 

201 ACAAAAAGCC GTCAACTTGG GTGATGCTTT AGACGGCGTA CCGGGCATTC 

251 ATGCCTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACAGGCAGAC GGATTAAAGT GTTGAACCAT CACGGCGAAA CGGGCGACAT 

351 GGCGGACTTC TCTCCAGACC ATGCAATCAT GGTGGACAGC GCCTTGTCGC 

4 01 AACAGGTCGA AATCCTGCGC GGTCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATTA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTTCACGACG ACGATGATGC 

951 ACACGCCCAT GCCCACAACG GCAAACCTTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG TGCTTTATCT GCCACATCCG AAGCGGTCAA 

1251 ACAACCGATG CTGCTTGACA ATAAAGTGCA AC ATT AC AG C TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGCTACGAC AAAGCATTGA TTGATCGGGA 

1401 AAACTACTAC AACCATCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

14 51 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCG 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGGGAA GACGCCTACG GCAACCGCCC ACTCATTGCC 

1951 CAAGCCGACC AAAACGCCCC TCGCGTTCCG GCTGCGCGCC TCGGCGTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGCATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 606; ORF 149-l.a>: 

al49-l .pep 

1 MAQTTLKPIV LSILLINTPL LSQA HGTEQS VGLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDS ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDQY GLPAHSHEYD DCHADI IWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDDAHAH AHNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

401 YLGQKSSALS ATSEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIRYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPLIA 

651 QADQNAPRVP AARLGVHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

al49-l/ml49-l 98.0% identity in 758 aa overlap 

10 20 30 40 50 60 

al 4 9-1 .pep MAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVSVVGKSRPRATSGLLHTSTASDKI 
I I ! I I I i I I I I i I I I I I I I ! I : 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ( t I 
ml 4 9-1 MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 

10 20 . 30 40 50 60 

70 80 90 100 110 120 

al 4 9-1 .pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
I M I I I I I I I I I I I I i I I I I I I I I I I I I I i I I I M I I I I I I I I I I I I I I I I I I f I I I I I I 
ml 4 9-1 ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

al 4 9-1 .pep SPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
I I I I I I I I I : I I I I I I i I I I I I I M I I I I I I I i I i I I i I I I I I I I I I I I M I I I i I I I I I 
ml49-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

al4 9-l . pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I I I M 11 I I I I I M I I I I I I I i I I I I I I I I I I f I I 1 I I I | I | I I I I I | | | I | | | | | | | | | 
ml 4 9-1 SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

al 4 9-1. pep SWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
I I I I I E I I I I : I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I 1 ! I I I I I ! 
ml 4 9-1 SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

al4 9-1 . pep DYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
I I I I I I I I I I I ! I I I : I I I I : I : I : I I I I i I I I I I I I 1 I I I I I I I II I I I I I I I i I I I I I 
ml49-l DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

al49-l . pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSEAVKQPM 
I i I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
ml 4 9-1 RH D E K AG DA VEN F FNNQT QN AR I EL RH Q P I GRL KG S WG VQ YLQQKS SAL S A I S E A V KQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

al49-l . pep LLDNKVQHYS FFGVEQANWDNFTLEGGVRVEKQKAS I RYDKALIDRENYYNHPLPDLGAH 
f I I I I I 1 I M I I I II I I I I I I I M I I I I I 1 I I I I I I I : I I I I I I I i I I I I I I I I I I ! I I I 
ml 4 9-1 LLDNKVQH Y S FFGVEQANWDNFTLEGGVRVEKQKAS I QYDKAL I DRENYYNH PL PDLG AH 

430 440 450 460 470 480 

490 500 510 520 530 540 

al4 9-l .pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFE VGNKHLNKER 
I I I I I I I I I t I I I I I f I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I II I I I I I I I 
ml 4 9-1 RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

al4 9-l . pep SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

I I I I I I i I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

ml 4 9-1 SNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
550 560 570 580 590 600 

610 620 630 640 650 660 
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al4 9-l.pep DFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQNAPRVP 
I M I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I I I II I 
ml49-l DFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRVP 

610 620 630 640 650 660 

670 680 690 700 710 720 

a 14 9-1. pep AARLGVHLKASLTDRI DANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 

mil 1 1 1 1 1 1 1 1 n 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ii 1 1 1 1 1 M 1 1 1 1 

ml49-l AARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 
670 680 690 700 710 720 

730 740 750 759 

al49-l .pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
I M I I I I I I I I I I I I I I I I I I I I I I I I j I I I I I I M I I I 
ml4 9-l WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

730 740 750 

The following partial DNA sequence was identified in N.gonorrhoeae <SEQ ID 607>: 

glBO.seq (partial) 

1 . . TACTGCAAGG CAGACCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT 

51 CACCGCCCGC CAATCCGATA AAGACGTGCG CCACATCGAA AT CGAT TTGA 

101 GCGGTTCGGA TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT 

151 GACAACGATC CGGCACTGGT CGGGGAAATC CTAGACCTGC TCGGCATCAA 

2 01 TCCGGCAACG GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG 
251 CACTGTTATC CCATTTCGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA 
301 GGCTATGCCA CGTTCGCCGA TAATGACGAA CTCGACCGTA TTGCTGCCGA 
351 CAACGCCGTT TTGCAAGGCT TTGTGCAAAG CACG CCGATT GCCGGTGTGC 
401 TGCACCGCTT CCCGGCAAAA CTGACGG CGG AACAATTCGC CGGCCTGCTG 
451 CGCCCGCTTG CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGC 
501 GGGGGACGAA GTGCACCTGA CCGTCGGCGC AGTGCGTTTC GAACACGAAG 
551 GGCGCGCCAG GGCGGGCGGC GCATCGGGTT TCTTTGCCGA CCGGCTGGAA 
601 GAGGACGGCA CGGTGCGCGT GTTTGCGGAA CGCAACGACG GCTTCAGGCT 
651 GCCCGAAGAC AGCCGCAAGC CGATTGTGAT GATCGGCTCC GGTACCGGCG 
701 TCGCACCGTT CCGCGCCTTC GTCCAACAAC GTGCCGCAGA AAATG CGGAA 
751 GGCAGAAACT GGCTGATTTT CGGCAATCCG CATTTTGCCG CCGACTTCCT 
801 CTATCAGACC GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTG CACAGAT 
851 ATGACTTCGC CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC 
901 AAAATCCGCG AACAGGCGGA AGGAC TTTGG CAATGGCTGC AGGAAGGCGC 
951 GCATATCTAT GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GAAGTGGAAG 

1001 CCGCCTTGCT GGATGTGATT ATCGGGGCAG GGCATTCGGA CGAAGACGGC 

1051 GCAGAAGGAT ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA 
1101 TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 608; ORF 150.ng>: 

glSO.pep (partial) 

1 ..YCKADPFPAA LLANQKI TAR QSDKDVRHIE IDLSGSDLHY L PGD ALGVW F 

51 DNDPALVGEI LDLLGINPAT EIQAGGKTLP VASALLSHFE LTQNTPAFVK 

101 GYATFADNDE LDRIAADNAV LQGFVQSTP I AGVLHRFPAK LTAEQFAGLL 

151 RPLAPRLYSI SSSQAEAGDE VHLTVGAVRF EHEGRARAGG ASGFFADRLE 

201 EDGTVRVFAE RNDGFRLPED SRKPIVMIGS GTGVAPFRAF VQQRAAENAE 

251 GRNWLIFGNP HFAADFLYQT EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD 

301 KIREQAEGLW QWLQEGAHIY VCGDAAKMAK EVEAALLDVI IGAGHSDEDG 

351 AEGYLDMLRE EKRYQRDVY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 609>: 

mlSO.seq 

1 ATG CAGAAC A CAAATCCGCC ATTACCGCCT CTGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGGCGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAACATCG CCGGCGAACG CCGCCTGCTG CTGGTTACCT CCACCCAAGG 

3 51 CGAAGGCGAA CCG CCGAAAG AAGCCGTCGT GCTGCACAAA CTG CTGAACG 
401 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 
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451 GGCGACAGTT CCTATCCGAA TTTCTGTCAG GCAGGTAAAG ATTTCGACCG 

5 01 GCGTTTTGAA GAATTGGGCG CAAAACGGCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TACCGCCTCC GCAAACGCCT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGCCCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCCGGCAACG 

901 GAAATACAGG CGGGCGGAAA GATGATGCCG GTTGCGCGCG CACTTTCATC 

951 TCATTTCGAA CTCACGCAAA ACACTCCGGC TTTCGTCAAA GGCTATGCCG 

10 01 CGTTCGCCCA TTATGAAGAA CTCGATAAAA TCATTGCCGA TAACGCCGTT 

1051 TTGCAGGATT TCGTGCAAAA CACGCCTATT GTCGATGTGC TGCACCGCTT 

1101 CCCGGCAAGC CTGACGGCAG AACAATTCAT CCGTTTACTG CGTCCGCTTG 

1151 CACCCCGTTT GTATTCGATT TCTTCAGCAC AGGCGGAAGT GGGCGATGAA 

1201 GTGCATTTAA CTGTCGGCGT GGTTCGTTTT GAACACGAAG GCCGCGCCAG 

12 51 AACGGG CGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

13 01 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

13 51 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 
14 51 GGCTGATTTT CGGCAATCCG C ATTTTGC C C GTGATTTTCT CTATCAAACC 
1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGGT ACGATTTCGC 
1551 CTGGTCCCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 
1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 

16 51 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 

17 51 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 610; ORF 150>: 
mlSO.pep 

1 MQNTNPPLPP LPPEITQLLiS GLDAAQWAWL SGYAWAKAGN GASAGLPALQ 

51 TALPAAEPFS VTVLSASQTG NAKSVADKAA DSLEAAGIQV SRAELKDYKA 

101 KNIAGERRLL LVTSTQGEGE PPKEAWLHK LLNGKKAPKL DKLQFAVLGL 

151 GDSSYPNFCQ AGKDFDRRFE ELGAKRLLER VDADLDFTAS ANAWTDNIAA 

2 01 LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKAAP FPAA LLANQKITAR 
251 QSDKDVRHIE IDLSGSDLHY LPGDALGVWF DNDPALVREI LDLLGIDPAT 
301 EIQAGGKMMP VARALSSHFE LTQNTPAFVK GYAAFAHYEE LDKIIADNAV 

3 51 LQDFVQNTP I VDVLHRFPAS LTAEQFIRLL RPLAPRLYSI SSAQAEVGDE 

4 01 VHLTVGWRF EHEGRARTGG ASGFLADRLE EDGTVRVFVE RNDGFRLPED 
4 51 SRKPIVMIGS GTGVAP FRAF VQQRAAENAE GKNWLIFGNP HFARDFLYQT 
501 EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD KIREQAEGLW QWLQEGAHIY 
551 VCGDAAKMAK DVEAALLDVI I GAGHLiDE EG AEEYLDMLRE EKRYQRDVY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 150 shows 91.3% identity over a 369 aa overlap with a predicted ORF (ORF 150.ng) 
from TV. gonorrhoeae: 

ml50/gl50 

210 220 230 240 250 260 

LLKEEAAKNRAT PAPQTTPPAGLQTAPDGRYCKAAPFPAALLANQKITARQSDKDVRHIE 

till I I I I I I I I I I I I I I I I I I I I I II I I 
YCKADPFPAALLANQKITARQSDKDVRHIE 
10 20 30 

270 280 290 300 310 320 

IDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPATEIQAGGKMMPVARALSSHFE 
I I I i M I I I I I I I I I I I I I I I II I I I I I I I j I I I I : I I I I II I I I I : II I II I I I I 
IDLSGSDLHYLPGDALGVWFDNDPALVGEILDLLGINPATEIQAGGKTLPVASALLSHFE 
40 50 60 70 80 90 



ml50 .pep 
gl50 

ml 50 . pep 
gl50 



ml50 . pep 



330 340 350 360 370 380 

LTQNTPAEVKGYAAFAHYEELDKI I ADNAVLQDFVQNTPI VDVLHRFPAS LTAEQFIRLL 
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I M I I I I M I I I I : I I : I M : I I I I t i 1 i I I I : I I I : I I I ! I I I : I I I I I I II 
gl50 LTQNTPAFVKGYATFADNDELDRIAADNAVLQGFVQSTPIAGVLHRFPAKLTAEQFAGLL 

100 110 120 130 140 150 

390 400 410 420 430 440 

ml50 . pep RPLAPRLYSISSAQAEVGDEVHLTVGWRFEHEGRARTGGASGFLADRLEEDGTVRVFVE 
I M I t I I I I I I I : I I I : I I I I I M I I : It I I I I I I I I : I M I I I :! II I I I I I I M I I : I 
gl50 RPLAPRLYSISSSQAEAGDEVHLTVGAVRFEHEGRARAGGASGFFADRLEEDGTVRVFAE 

160 170 180 190 200 210 

450 460 470 480 490 500 

ml50 . pep RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGKNWLIFGNPHFARDFLYQT 
I M I E M I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I : I 1 I I I I I I I I I I I I I I I 
gl50 RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGRNWLIFGNPHFAADFLYQT 

220 230 240 250 260 270 

510 520 530 540 550 560 

ml50 . pep EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 
I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I 
gl50 EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 

280 290 300 310 320 330 

570 580 590 600 

ml 50 . pep DVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 
: I I I I I I I I I I I M I II : I I I I II I I I I M I I I 11 I I I 
gl50 EVEAALLDVIIGAGHSDEDGAEGYLDMLREEKRYQRDVYX 

340 350 360 370 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 61 1>: 

al50.seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT ATGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGACGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAAC AT C G CCGGCGAACG CCGCCTGCTG CTGGTTACCT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGGAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

4 01 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 

451 GGCGACAGCT CCTATCCGAA TTTCTGCCGG GCGGGCAAAG ATTTCGACAA 

501 ACGTTTTGAA GAATTGGGCG CAAAACGCCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TGCCGCCGCC GCAGACGGAT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGACCCCTT TCCCGCCGCC CTGCTGGCCA AT C AG AAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCAGGCAACG 

901 GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG CACTGTTATC 

951 CCATTTTGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA GGCTATGCCC 

1001 CGTTCGCCGA TGAT GACGAA CTCGACCGTA TTGCTGCCGA CAACGCCGTT 

1051 TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGATGTGC TGCACCGCTT 

1101 CCCGGCAAAA CTGACAGCGG AACAATTCGC CGGCCTACTG CGCCCGCTTG 

1151 CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGT GGGGGACGAA 

1201 GTGCACCTGA CCGTCGGCGC GGTGCGTTTC GAACACGAAG GGCGCGCCAG 

1251 GGCGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

1301 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

1351 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

14 51 GGCTGTTTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 

1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGAT ACGATTTCGC 

1551 CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 

1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 

1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
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17 01 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
17 51 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 



This corresponds to the amino acid sequence <SEQ ID 612; ORF 150.a> 

alSO.pep 

MQNTNPPLPP MPPEITQLLS GLDAAQWAWL SGYAWAKAGN 

TALPTAEPFS VTVLSASQTG NAKSVADKAA 
KNIAGERRLL LVTSTQGEGE 

GDSSYPNFCR AGKDFDKRFE ELGAKRLLER VDADLDFAAA 

LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKADPFPAA 

QSDKDVRHIE IDLSGSDLHY LPGDALGVWF DNDPALVREI 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



DSLEAAGIQV 
PPEEAWLHK LLNGKKAPKL 



EIQAGGKTLP VASALLSHFE LTQNTPAFVK GYAPFADDDE 
LQGFVQSTPI ADVLHRFPAK LTAEQFAGLL RPLAPRLYSI 
VHLTVGAVRF EHEGRARAGG ASGFLADRLE EDGTVRVFVE 
SRKPIVMIGS GTGVAPFRAF VQQRAAENAE GKNWLFFGNP 
EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD KIREQAEGLW 
VCGDAAKMAK DVEAALLDVI IGAGHLDEEG AEEYLDMLRE 



GASAGLPALQ 
SRAELKDYKA 
DKLQFAVLGL 
ADGWTDNIAA 
LLANQKITAR 
LDLLGIDQAT 
LDRIAADNAV 
SSSQAEVGDE 
RNDGFRLPED 
HFARDFLYQT 
QWLQEGAHIY 
EKRYQRDVY* 



m!50/al50 



94.8% identity in 599 aa overlap 
10 



20 



30 



40 



50 



60 



ml 50 . pep MQNTNPPLPPLPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPAAEPFS 
M ! I I t I I I I : I I I I I ! I I I I I I M ! I t 1 I I t f I I I I I I I I I I I I I I 1 I I I I !!: I I I I I 
a 150 MQNTNPPLPPMPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPTAEPFS 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml 50 . pep VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 
I I! I I I I I I I I I I I I I I I I I I I I I I I i I I I M I i II I I i I I I I I I I I I I I I I I I I I I t I I 
al50 VTVL S AS QTGNAKS VA DKAADS L EAAG I QVSRA ELK DYKAKN I AGE RRLL LVTSTQGEGE 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 50 . pep PPKEAWLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCQAGKDFDRRFEELGAKRLLER 
I I : I I I I I I I I I I I I I I I I I i I M I I I I I I I I I I I I I I I : I I I I I I : I I I I I I I I I I I I I 
al50 PPEEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCRAGKDFDKRFEELGAKRLLER 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 5 0 . pep VDADLDFTASANAWTDNI AALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAA 

I I I I I I I : I : I : : I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I 
al50 VDADLDFAAAADGWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKADPFPAA 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 50 . pep LLANQKITARQSDKDVRHIEIDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPAT 
I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I t I t I I I M I I I I I I I M I I I I I 1 II 
al50 LLANQKITARQSDKDVRHIE I DLSGSDLHYLPGDALGVWFDNDPALVREI LDLLGIDQAT 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 50 . pep EIQAGGKMMPVARALSSHFELTQNTPAFVKGYAAFAHYEELDKIIADNAVLQDFVQNTPI 
I I I I I I I : I I I II I I I I I I I I I I I I I I I I I i I : I I I : I I II I I I I I I I : I I I 
a 150 EIQAGGKTLPVASALLSHFELTQNTPAFVKGYAPFADDDELDRIAADNAVLQGFVQSTPI 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 50 .pep VDVLHRFPASLTAEQFIRLLRPLAPRLYSISSAQAEVGDEVHLTVGVVRFEHEGRARTGG 
: I I I I I I I I : 1 I I I I I I I I M I I I I I I I I I : I I I I I I I I 1 I I I I : II I I I II I I I : I I 
al50 ADVLHRFPAKLTAEQFAGLLRPLAPRLYSISSSQAEVGDEVHLTVGAVRFEHEGRARAGG 
370 380 390 400 410 420 



430 440 450 460 470 480 

ml 50 . pep ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 
I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
al50 ASGFLADRLEEDGTVRVFVERNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 

430 440 450 460 470 480 



ml50 . pep 



490 500 510 520 530 540 

GKNWLIFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
I I I I I : I I II I I I I M I I I I I I I I I I I I I I it I I I I I I I I I I I I I I I I I It I I I I I I I I I 
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al50 GKNWLFFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
490 500 510 520 530 540 

550 560 570 580 590 600 

ml 50 . pep QWLQEGAHIYVCGDAAKMAKDVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 
I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 t I I I I I I I t I I I I I I I I I I I 
a 1 5 0 QWLQEGAH I Y VCGDAAKMAKDVEAALLD V 1 1 GAGHLDEEG AEE YLDMLREEKRYQRDVYX 

550 560 570 580 590 600 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 61 3>: 

glSl . seq 

1 ATGAAAC7AAA TCCGCAACAT CGCCATCATC GCACACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTG 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TGGAGCGCGT TTTGGGGATG GTGGATTGCG TCGTCTTGTT GGTGGACGCA 

301 CAGGAAGGTC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAACCGTCCG 

401 CCCGTCCGAG CTGGGTTATC GACCAGACTT TCGAGTTGTT CGACAACTTG 

451 GGTGCGACCG ACGAGCAGTT GGATTTCCCG ATTGT TTACG CTTCAGGTTT 

501 GAGCGGCTTT GCCAAGCTGG AAGAAAccga CGAGAGCAGC GATATGCGCC 

551 CGCtgttcgA CACCATCCTA AAATACAcgc ctgCACCGAG CGGCAGCGCG 

601 GACGAGCCGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCT TGAA CGGACGCATC AAACCCGGCC 

701 AAACCGTTGC CGTGATGAAC CACGAGCAGC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGCTTGGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATTGTGA TTATTTCCGG T AT CGAAGAC ATCGGCATCG 

851 GCGTAAC CAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTAAACA CCAGCCCGCT 

951 CGCAGGTACA GAAGGCAAAT TCGTGACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTTGCCC TGCGCGTGGA AGACACCGCC 

1051 GatgCCGACG TGTTCCGCGT ATCcgGGCGC GGCGAACTGC ACCTGACGAT 

1101 TTTGCTGGAA AATATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAGC 

1151 CGCGCGTCGT GTACCGAGAC ATCGACGGTC AAAAATGCGA ACCTTATGAA 

1201 AACCTGACTG TGGACGTACC cgacgacAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACCAA TATGGAAAGC GACGGCAACG 

13 01 GacgCACCCG CCT CGAATAC CATATTC C AG CGCGCGGCTT GATCGGTTTC 
1351 CAAGGCGAAT TCATGAC CCT GACGCGCGGC GTCGGGCTGA TGAgccacGT 

14 01 GTTcgacgac tacgcgcccg t caAACCCGA TATGCCCGGC CGCCACAACG 
1451 GCGTactggt GtCCcaAGAG CAGGGCGAGG CGGTTGCTTA CGCCTTGTGG 
1501 AATCTTGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 
1551 CGAAGGTATG ATTATCGGCA TCCACAGCCG CGACAACGAT TTGGTGGTCA 
1601 ACCCGCTCAA AGGCAAAAAA CTCACCAATA TCCGTGCCAG CGGTACCGAC 
1651 GAAG CGGTGC GCCTGACCAC GCCGATCAAA CTGAcgcTGG AAGGCGCGGT 
1701 CGAGTTTATC GACGATGACG AGCTGGTGGA AATCACGCCG CAAtccatCC 
1751 gcctgcgcat gcgttacctG AGCGaattgg aacgccgccg tcaTTTTAAA 
1801 AagctgGATT AA 

This corresponds to the amino acid sequence <SEQ ID 614; ORF 151.ng>: 

gl51 .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGCHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALAXiGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESS DMRPLFDTIL KYTPAPSGSA 

201 DEPLQLQ I SQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HEQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTITD K DNPKGLPMLS 

3 01 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 
351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

4 01 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 
451 QGEFMTLTRG VGLMS HVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
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501 NLEDRGRMPV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRMRYL SELERRRHFK 

601 KLD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 615>: 

ml51 . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACG AC CT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGCGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

3 01 CAGGAAGGCC CGATGCCG CA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

3 51 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAGCCGTCCG 

4 01 CTCGTCCGAG CTGGGTTATC GACCAAACTT TCGAGCTGTT CGACAACTTG 
4 51 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGGTT 
501 GAGCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 
551 CGCTGTTCGA TACTATCTTA AAATATACGC CTG CACCGAG CGGCAGCGCG 
6 01 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 
651 CGGCCGCCTC GGTATCGGTC GTAT CTTGAA CGGACGCATC AAACCCGGCC 
701 AAACCGTTGC CGTCATGAAC CACGATCAGC AAATCGCCCA AGGCCGCATC 
751 AACCAGCTTT TGGGTTTCAA AGGTTTGGAA CGCGTGCCGC TTGAAGAAGC 
801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATCGAAGAC ATCGGTATCG 
851 GCGTAACCAT CACCGACAAA GACAATCCCA AAGGCCTACC GATGTTGAGC 
901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGCT 
951 GGCGGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATAC CGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA AC CGT ATGAA 

1201 AACCTGACCG TGGATGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

13 01 GACGCACCCG C CTCGAAT AC CATATTCCAG CGCGCGGCTT GATCGGTTTC 

13 51 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

14 01 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCCGGC CGCCACAACG 
14 51 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 
1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 
1551 CGAAGGCATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 
1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 
1651 GAAGCCGTTC GCCTGACCAC GCCAATCAAG CTGACGCTGG AAGGTGCGGT 
1701 TGAGTTTATC GACGATGACG AACTCGTTGA AATCACGCCG CAATCCATCC 
1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG C C ACTTT AAA 
1801 AAGCTGGATT GA 

This corresponds to the amino acid sequence <SEQ ID 616; ORF 151>: 

ml51 -pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED 1GIGVTIT DK DNPKGLPMLS 

3 01 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

3 51 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

4 01 NLTVDV PDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 
451 QGE FMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 
551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 
601 KLD* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 151 shows 99.2% identity over a 603 aa overlap with a predicted ORF (ORF 151.ng) 
from N. gonorrhoeae: 
ml51/gl51 

10 20 30 40 50 60 

ml51 .pep MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 
! I I I II I I M I I I I M I 1 I M I I I I I I! I I I I I I I t I I I I I M I I I I I I I I I I I I I I I M 
gl51 MKQIRNIAIIAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml51 . pep AIDYEGYHINIVDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 
I I I I I I I I I I I I I II I U I I I I I II I I I I I I I I I I I I I I I I I I I II M 1 I M I I I I I I I 
gl51 AIDYEGCHINIVDT PGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

mlSl.pep KPIWINKIDKPSARPSWVIDQT FELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 

I I I I I I I I I I I I I I I I It M I I I I II I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I : 
g 151 KPIWINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml51 . pep DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 

II I I I I I I I I I I I I I I I I I II I I II I M I I I I I I I I I I I I M I I M It I I I I M I M I I 
g 151 DMRPLFDTILKYTPAPSGSADEPLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml51 .pep HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 

I : I I I I II I 1 I I I I I I M II I I I M I I it I I I I II M I I 1 I I i I I M M I I I I M I M I I 
g 151 HEQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml51 . pep VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 
I I I I I I M I 1 I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I ! I I M I I I I I M I I I I I I 
gl51 VDEPTLTMDFiyrVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml51 . pep GELHLTILLENMRREGYELAVGKPRVVYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 
M | | I | I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I II I I It I I I I I I I I I I I I 
g 151 GELHLTILLENMRREGYELAVGKPRWYRDIDGQKCEPYENLTVDVPDDNQGAVMEELGR 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml51 . pep RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 
I I M I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I 
gl51 RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml51 .pep RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMI IGIHSRDNDLWNPLKGKK 

I I I M I I I I I I I I I I II M I I I I I I II I I I M I I I t I I I I I I I I I I I II I I I I I I M I M 
gl51 RHNGVLVSQEQGEAVAYALWNLE DRGRMFVSPNDKIYEGMI IGIHSRDNDLWNPLKGKK 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml51 . pep LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 
I I I I I I I I I I I II I M I I I I I I I I II I II I I I I I I I I I I I I I I 1 II I I I I I I I I I I II I 
gl51 LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRMRYLSELERRRHFK 

550 560 570 580 590 600 
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mlSl.pep KLDX 
I I I I 

gl51 KLDX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 617>: 

al51 . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AAC C AC AT T G GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC C AAAAAC AC C GCCATCGATT ACGAAGGCTA 

201 C C AC AT CAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

2 51 TAG AG CG AGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA TAAAATCGAC AAACCGTCCG 

4 01 CCCGTCCGAG CTGGGTCATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

451 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTATG CTTCCGGTCT 

501 GTCCGGTTTC GCCAAATTGG AAGAAACCGA C GAG AG C AAC GACATGCGTC 

551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGTATC AAGCCCGGTC 

701 AAGTTGTTGC CGTCATGAAC CACGATCAAC AAATCGCCCA AGGCCGCATC 

751 AACCAGCTTT TGGGTTTCAA AGGTTTAGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC G AC AT CGTG A TTATTTCCGG TATTGAAGAC ATCGGCATCG 

851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGTT 

951 GGCAGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGACGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

1301 GACGCACCCG CCTCGAATAC C AT ATT CC AG CGCGCGGCTT GATCGGCTTC 

1351 CAAGGCGAAT T TAT G AC C CT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

14 01 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCTGGC CGCCACAACG 

14 51 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGTATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCCGTTC GCCTGACCAC GCCGATTAAG CTGACGCTGG AAGGTGCGGT 

17 01 CGAGTTTATC GACGATGATG AGCTGGTAGA AATCACGCCG CAATCCATCC 

1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCATTTCAAA 

1801 AAGCTAGATT GA 

This corresponds to the amino acid sequence <SEQ ID 618; ORF 15La>: 

al51 .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQVVAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRVVYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

451 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 

601 KLD* 

ml 5 1 /al 51 99.8% identity in 603 aa overlap 

10 20 30 40 50 60 
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ml 51 .pep MKQIRNIAI IAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 

I I I I I i I I I I I I I I I I I I I I I I I M i I I I I I I I 1 M E I I I I I I I I I I i I I I I I I I I i M I 
al51 MKQIRNIAI IAHVDHGKTTLVDQLLRQSGTFRANQQVDERVMDSNDLEKERGITILAKNT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 5 1 . pep AIDYEGYHINI VDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 

I I I I I I I I I I I I I i I I I I I I I I I I I t I I t I I I I I I f I I I I I I I I I I I I I I I I M I I I I I t 
alSl AIDYEGYHINI VDTPGHADFGGEVERVLGMVDCWLLVDAQEGPMPQTRFVTKKALALGL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 51 .pep KPIVVINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 
M I I I I I I I I II I i I I I I I I 11 M I I I I I I I I 1 I I I I I I I I I t I I I I I I I I I I I I I I I I I 
al51 KPIWINKIDKPSARPSWVIDQTFELFDNLGATDEQLDFPIVYASGLSGFAKLEETDESN 

130 140 150 160 170 180 

190 200 210 220 230 240 

mlSl.pep DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQTVAVMN 
I I I I I I I I I I I I I I I I I I I I M I 1 I I I I I 1 I I 1 I I I I t II I I I I I I ! I I ! I 1 I 1 : t I I I I 
al51 DMRPLFDTILKYTPAPSGSADETLQLQISQLDYDNYTGRLGIGRILNGRIKPGQVVAVMN 

190 200 210 220 230 240 

250 260 270 280 290 300 

mlSl.pep HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 
I I I I I I I I I I I I I I 1 M I I I II I I I I I I 1 I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I 
al51 HDQQIAQGRINQLLGFKGLERVPLEEAEAGDIVIISGIEDIGIGVTITDKDNPKGLPMLS 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml51 .pep VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
al51 VDEPTLTMDFMVNTSPLAGTEGKFVTSRQIRDRLQKELLTNVALRVEDTADADVFRVSGR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 5 1 . pep GELHLTI LLENMRREGYELAVGKPRVV YRDI DGQKCEPYENLTVDVPDDNQGAVMEELGR 

I I I I I I I I I I I I I I I I I I I I II M I I I I M I I I I I I I I I I 1 I I I M I I I I I I I I 1 I I I I I 
a 1 5 1 GELHLTI LLENMRREGYELAVGKPRVV YRDI DGQKCEPYENLTVDVPDDNQGAVMEELGR 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 51 . pep RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 
I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
al51 RRGELTNMESDGNGRTRLEYHIPARGLIGFQGEFMTLTRGVGLMSHVFDDYAPVKPDMPG 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 51 . pep RHNGVLVSQEQGEAVAYALWNLEDRGRMFVSPNDKIYEGMIIGIHSRDNDLVVNPLKGKK 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I t 
a 1 5 1 RHNGVLVSQEQGEAVAYALWNLEDRGRMFVS PNDKI YEGMI IGIHSRDNDLVVNPLKGKK 

490 500 510 520 530 540 

550 560 570 580 590 600 

mlSl.pep LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 
I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I 
al51 LTNIRASGTDEAVRLTTPIKLTLEGAVEFIDDDELVEITPQSIRLRKRYLSELERRRHFK 

550 560 570 580 590 600 



mlSl.pep KLDX 
I I I I 

al51 KLDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 61 9>: 

gl52 . seq 

l ATGAAAAaca aAACCaaagt ctgGGacttc cCcacccgcc ttTTCCactG 

51 GctgcttgCC gCATCCctgc CCTTTATGTG gtatagCGCA AAAGCCGGCG 

101 GcgataTGCT GcaatgGCAC ACGCGCGTCG GGCTGCTCGT CCTTTTCCTG 
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151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAgcgATA CCGCCCGTTT 

201 CTCccgTtTC GTCCGAGGTT GGGCAGGTAT ACGCGGCTAT CTGAAAAAcg 

251 gCATTCCCGA ACAtatcCAG CCCGGACACA ACCCCTTGGG CGCACTgatg 

301 gtcGTTGCGC TTTTGgccgc cgtcTCATTT CAagtcggcA CGGGGCTTTT 

3 51 Tgccgccaat gaaaacacct tcagcaCCAa cggctacctc aaccatttgg 

401 tttccgaaca tacgGGCAGC CTTATACGGA AAATCCACCT CAACTTTTTC 

451 AAGCTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGCCG TCGCCGCATA 

501 CCGCATATTC AAAAAGAAAA ACCTCGTCCG CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGCT TTGCCGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 620; ORF 152.ng>: 

gl52 .pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLLVLFL 

51 LVFRLCWGI W GSDTARFSRF VRGWAG I RGY LKNGIPEHIQ PGHNP LGALM 

101 WALLAAVSF QV GTGLFAAN ENTFSTNGYL NHLVSEHTGS LIRKIHLNFF 

151 KLLAVFSAVH IAAVA AYRIF KKKNLVRPMI TGFKYI EGKT SIRFAGKAAL 

201 AAALSVAALA AAAILLL S* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 62 1>: 



ml52 . seq 












1 


ATGAAAAACA 


AAACCAAAGT 


CTGGGACCTC 


CCCACCCGCC 


TTTTCCACTG 


51 


GCTGCTTGCC 


GCGTCCCTGC 


CCTTTATGTG 


GTATAGCGCG 


AAAGCCGGCG 


101 


GCGATATGCT 


GCAATGGCAC 


ACGCGCGTCG 


GGCTGTTCGT 


CCTTTTCCTG 


151 


CTCGTATTTC 


GCCTCTGCTG 


GGGCATTTGG 


GGCAGCGATA 


CCGCCCGTTT 


201 


TTCCCGTTTC 


GTCCAAGGCT 


GGGCAGGCAT 


ACGCGGCTAT 


CTGAAAAACG 


251 


GTATTCCCGA 


ACACATCCAG 


CCCGGACACA 


ACCCCTTGGG 


CGCACTGATG 


301 


GTCGTTGCGC 


TTTTGGCCGC 


CGTGTCCTTC 


CAAGTCGGCA 


CCGGGCTTTT 


351 


TGCCGCCGAT 


GAAAACACCT 


TCAGCACCAA 


CGGCTACCTC 


AACCATTTGG 


401 


TTTCCGAACA 


TACGGGCAGC 


CTTATGCGGA 


AAATCCACCT 


CAACTTTTTC 


451 


AAGCTGCTCG 


CCGTTTTTTC 


TGCAATCCAC 


ATCGCCGCCG 


TCGCCGCATA 


501 


CCGCGTATTC 


AAAAAGAAAA 


ACCTCATCCT 


CCCGATGATA 


ACCGGCTTCA 


551 


AATACATCGA 


AGGCAAAACC 


TCAATCCGCT 


TTGCAGGCAA 


AGCCGCGCTT 


601 


GCCGCCGCAT 


TATCGGTTGC 


CTCGCTTGCC 


GCAGCCGCCA 


TCCTGCTCCT 


651 


GTCCTGA 











This corresponds to the amino acid sequence <SEQ ID 622; ORF 152>: 

ml52 .pep 

1 MKNKTKVWDL PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLFVLFL 

51 LVFRLCWGIW GSDTARFSRF VQG WAG I RGY LKNGIPEHIQ PGHNP LGALM 

101 WALLAAVSF QV GTGLFAAD ENTFSTNGYL NHLVSEHTGS LMRKI HLNFF 

151 KLLAVFSAIH IAAVA AYRVF KKKNLILPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVASLA AAAILLL S* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N.gonorrhoeae 

ORF 152 shows 95.4% identity over a 218 aa overlap with a predicted ORF (ORF 152.ng) 
from N. gonorrhoeae: 

m!52/gl52 



10 20 30 40 50 60 

ml 5 2 . pep MKNKTKVWDL PTRLFHWLLAAS LP FMWYSAKAGGDMLQWHTRVGLFVLFLLVFRLCWG I W 

I I I I I I I I I = I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I M I I I I I I I | I | | 
gl52 MKNKTKVWDFPTRLFHWLLAASLPFMWYSAKAGGDMLQWHTRVGLLVLFLLVFRLCWGIW 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml5 2 . pep GSDTARFSRFVQGWAGIRGYLKNGIPEHIQPGHNPLGALMWALLAAVSFQVGTGLFAAD 

1 1 M llllll hi 1 1 1 1 IIMMIIIIIIIIllll MMIIIMIIIMMMIIII Ih 

gl 5 2 GSDTARFSRFVRGWAGI RGYLKNGI PEH I Q PGHNPLGALMWALLAAVS FQ VGTGL FAAN 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml52 . pep ENTFSTNGYLNHLVSEHTGSLMRKIHLNFFKliIJ^VFSAIHIAAVAAYRVFKKKNLILPMI 

II M I M 1 1 Mill M M MMIMI M I IM 1 1 1 1 1 MM I II 1 1 I ! M M M M III 

gl52 ENTFSTNGYLNHLVSEHTGSLIRKIHLNFFKLLAVFSAVHIAAVAAYRIFKKKNLVRPMI 

130 140 150 160 170 180 



190 200 210 219 

ml52 .pep TG FKY I EGKTS I RFAGKAALAAALS VASLAAAA I LLLSX 

I M II I II M II M 111 1 1 MINI lM III Ml I III I 

gl52 TGFKYIEGKTS I RFAGKAALAAALS VAALAAAAI LLLSX 

190 200 210 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 623>: 

al52 . seq 

1 ATGAAAAACA AAACCAAAGT 

51 GCTGCTTGCC GCATCCCTAC 

101 GCGATATGCT GCAATGGCAC 

151 CTCGTATTCC GCCTCTGCTG 

201 CTCCCGTTTC GTCCGCGGAT 

251 GTATTCCCGA ACACGTCCAA 

301 GTCGTTGCGC TTTTGGCCGC 

351 TGCCGCCGAT GTAAACACCT 

4 01 TTTCCGAACA TACGGGCAGC 

4 51 AAACTGCTCG CCGTTTTTTC 

501 CCGCGTGTTC AAAAAGAAAA 

551 AATACATCGA AGGCAAAACC 

601 GCCGCCGCAT TATCGGTTGC 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 624; ORF 152.a>: 

al52 .pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KTGGDMLQWH TR VGLFILFL 

51 LVFRLCWGIW GSDTARFSRF VRGWSGIREY MKNGIPEHVQ PGHNPLGALM 

101 VVALLAAVSF QV GTGLFAAD VNTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

151 KLLAVFSAVH IAXVA AYRVF KKKNLVLPMI TGFKYIEGKT SIRFAGKAAL 

201 AAAL S V AAL A AAAILL LS* 

ml52/al52 94.0% identity in 218 aa overlap 

10 20 30 40 50 60 

ml52 . pep MKNKTKVWDLPTRLFHWLLAASLPFMWYSAKAGGDMLQWHTRVGLFVLFLLVFRLCWGIW 
I I I I M I I I : I I I I I I I M 1 I I I II i It 1 11 : I I I I I I I I i I I I I i : i II M I I I 1 I I M 
al52 MKNKTKVWDFPTRLFHWLLAASLPFMWYSAKTGGDMLQWHTRVGLFILFLLVFRLCWGIW 

10 20 30 40 50 60 



CTGGGACTTC 
CCTTTATGTG 
ACGCGCGTCG 
GGGCATTTGG 
GGTCGGGTAT 
CCCGGACACA 
CGTGTCGTTC 
TCAGCACCAA 
CTTATGCGGA 
CGCAGTCCAC 
ACCTCGTCCT 
TCAATCCGCT 
CGCGCTTGCC 



CCCACCCGCC 
GTATAGCGCG 
GGCTGTTTAT 
GGCAGCGATA 
CAGAGAGTAT 
ACCCCTTGGG 
CAAGTCGGCA 
CGGCTACCTC 
AAATCCATCT 
ATCGCCGNCG 
CCCGATGATA 
TTGCCGGCAA 
GCAGCCGCCA 



TTTTCCACTG 
AAAACCGGCG 
CCTTTTCCTG 
CCGCCCGTTT 
AT GAAAAACG 
CGCACTGATG 
CAGGGCTTTT 
AACCATTTGG 
CAACTTTTTC 
TCGCCGCATA 
ACCGGCTTCA 
AGCCGCGCTT 
TCCTGCTCCT 



70 80 90 100 110 120 

ml52 . pep GSDTARFSRFVQGWAGIRGYLKNGIPEHIQPGHNPLGALMWALLAAVSFQVGTGLFAAD 
M I I I I I I I I I : I I : I I I I : I I II I I I : I I I I I I I I I I I I ! I M f I I I I I I I M I I I M 
al52 GSDTARFSRFVRGWSGIREYMKNGIPEHVQPGHNPLGALMWALLAAVSFQVGTGLFAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml52 . pep ENTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAIHIAAVAAYRVFKKKNLILPMI 
(I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I II : 1 I 1 111111111111:1111 
al52 VNTFSTNGYLNHLVSEHTGSLMRKIHLNFFKLLAVFSAVHIAXVAAYRVFKKKNLVLPMI 

130 140 150 160 170 180 



190 200 210 219 

m!52 .pep TG FKY I EGKT S I RFAGKAALAAALS VAS LAAAAI LLLSX 

I I I I I I I I I I I I I I I I I I I I I I I I 11 I : 1 1 I I I I 1 I I I I 
al52 TGFKYIEGKT SI RFAGKAALAAALS VAALAAAAI LLLSX 

190 200 210 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 625>: 

gl53.seq 

1 atggggtttg cttaCAgtat gacgtatatc gaggtCGGGa taccggaggc 

51 ggcatccgtc ctttCgctGC CCGAGATgat gcgcctgatG GTGTTtCagg 

101 at t ATGGTTT TttggcCGAA GTGATGTTTG TGctgaCTTT cGGCGcgcCG 

151 GTTCTGTTtC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTG CGCTTGA 

251 GGCAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTCT GGTGG CGTAT 

301 ATCAAGCTCT CGTCTGTGGC AAAGGTTCGC TTCGGGCCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCGGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

401 AGCATTGGGT GTATTTCCAA ATCGGGCGGC TGACGGGGAA TAATGCGGTT 

451 C AG ACGG CAT CGGAAGGCAA AACCTGTTGC AGCCGCTGCC TGTATTTccg 

501 cgacAGTgcc gaatccCCCT GCGGGGTGTg cgGCGcggaA CTgtacggcg 

551 gacggccgaa aagtCTGAGt atttCgtCGG CGTTTCTgac ggcggcggTT 

601 GTTTTGTATT TCCctgCcaa TATCctgccg attaTGAttt cgtccAATCc 

651 tgccgccacg GAGGcCAACA CCAT CTTT AG CGGCATCGCT TATATGTGGG 

701 ACGagggcgA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTGC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGGCGGCACG 

801 GTTCGCTTTG CCGGCGGGCG CAAAGAAATT GTCGCACCTC tacCGCATCA 

851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCacaC TTATG CCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTTT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCG CTTCAG A CGGCATTGCT 

1051 TTCAACGAAA CGGAAAAATA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 626; ORF 153.ng>: 

gl53 .pep 

1 MGFAYSMTYI EVGIPEAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

5 1 VLFLLLCLYV YAALI RKCAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 

101 IKL SSVAKVR FGP AFYLMFA LSVMLIRTSV SVPQHWVYFQ I GRLTGNNAV 

151 QTAS EGKTCC SRCLYFRDSA ESPCGVCGAE LYGGRPKSLS I S S AFLTAAV 

2 01 VLYFPANILP I MISSNPAAT EANTIFSGIA YMWDEGDRLI AAVIFSASIL 
251 VPVLKIAAMS VLIAAA RFAL PAGAKKLSHL YR I TEA VGRW SMIDIFVIII 
301 LMC S FHT YAA R VIPGSAAVY FCLWILTM L SAYYFDPRLL WDKRASDG I A 
351 FNETEKYD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 627>: 

ml53 . seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACTTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGG CGTAT 

3 01 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGGCCGG CGTTTTATCT 
351 GATGTTCGCG CTGTCAGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

4 01 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 
4 51 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 
501 CGACAGTGCC GAATCCCCCT GCGG CGTGTG CGGTGCGGAA CTGTACCGCC 
551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 
601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 
651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 
701 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 
751 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 
801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 
851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 
901 TTGATGTGTT CGTTCCACAC TTATG C CGCG CGCGTCATTC CGGGCAGTGC 
951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC G CG CTTCAG A CGGCATTGCT 

1051 TTCAATGAAA CGGAAAAACA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 628; ORF 153>: 

ml53 .pep 

1 MAF AYGMTY I EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLCLYV YAALI RKQAY PALRLATRVM VRLRQ AMMVD VFFVSTLVAY 
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101 IKL SSVAEVR FGP AFYLMFA LSVMLIRTSV SVPQHWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISS AFLTAAV 

2 01 ILYFPANILP I MISSNPAAT EVNTIIiNGIA YMWDEGDRLI AAVI FSAS IL 
251 VPVLKIAAMS VLIAS ARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

3 01 LMC SFHTYAA R VIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 
3 51 FNETEKHD* 

ml53 / gl53 96.1% identity in 358 aa overlap 

10 20 30 40 50 60 

ml53 . pep MAF AYGMT Y I E VG I PG AAS VLS LPEMMRLMVFQD YG F LAEVM F VLT FG AP VLF LLLCLYV 

hllhlllllilll Mil IMIIIII IIMMIlll!! ,1 III !l: IIIM III Mil 

g X 53 MGFAYSMTYIEVGI PEAASVLSLPEMMRIJWFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 5 3 pep YAAL I RKQA YPALRI^TR VMVRLRQAMMVD VFFVSTLVAY I KLS SVAEVRFG P AFYLMFA 

II 1 li I I'MI III III lllllilllIMM! IMIIIIMMII IhlllM Mill It 

gl 5 3 YAAXi I RKQAYPALRLATRVMVRLRQAMMVDVFF VS TLVAYI KLS S VAKVRFG P AFYLMFA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 5 3 pep LS VMLI RTS VS V PQHWVY FQ I GRLTGDN AVQT AS EGKTCCSRC L YFRD S AE S PCGVCGAE 

I I I I I I f I I ] I I 1 I I 1 1 I t 1 t I J I I I ^ J I I f I t 1 I I I 1 1 I 1 I 1 1 1 t I I I 1 1 I 1 1 I 1 1 I I t 
g i53 LSVMLIRTSVSVPQHWVYFQIGRLTGNNAVQTASEGKTCCSRCLYFRDSAES PCGVCGAE 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml53 . pep LYRRRPKSLS I SSAFLTAAVILYFPANILPIMISSNPAATEVNT I LNGIAYMWDEGDRLI 

II 1 1 1 1 1 1 1 1 1 1 1 1 II I I = I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M I i h I II 1 1 1 1 1 M 1 1 1 

gl53 LYGGRPKSLSISSAFLTAAWLYFPANILPIMISSNPAATEANTIFSGIAYMWDEGDRLI 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml53 . pep AAVI FSAS ILVPVLKI AAMSVLIASARFALPTGAKKLSHLYRI TEAVGRWSMIDIFVI 1 1 

MM : MMM I il MMMM IMMI MiMIII MUM IHIMM IMMIMM 

gl 5 3 AAVI FSAS I LVPVLKIAAMSVLI AAARFALPAGAKKLSHLYRITEAVGRWSMIDIFVI 1 1 

250 260 270 280 290 300 

310 320 330 340 350 359 

ml 5 3 . pep LMCSFHTYAARVI PGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGI AFNETEKHDX 

IIMIllllllllllllllllllllllllMMIIIIIIIIIIIIIIMIIIIIIhll 

gl53 IMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKYDX 
310 320 330 340 350 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 629>: 

al53 .seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACCTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

2 51 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGATCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCGGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

4 01 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

4 51 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG AT T AT G ATT T CGTCCAATCC 

651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

7 01 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

7 51 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 
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851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCACAC TTATGCCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAATGAAA CGGAAAAACA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 630; ORF 153.a>: 

al53.pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLC LYV YAALIRKQAY PALRLATRVM VRLRQA MMVD VFFVSTLVAY 

101 IKLSSVAEVR FG S A FY LM FA LSVMLIRTSV SVPQHWVYFQ IGRLTGDNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYRRRPKSLS ISSAFLTAAV 

201 ILYFPANILP I MISSNPAAT EVNTILNGIA YMWDEGDRL I AAVIFSASIL 

251 VPVLKIAAMS VLIASARFAL PTGAKKLSHL YRITE AVGRW SMIDIFVIII 

301 LMC SFHTYAA R VIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

351 FNETEKHD* 

ml53/al53 99.7% identity in 358 aa overlap 

10 20 30 40 50 60 

MAFAYGMTYI EVG I PGAASVLSLPEMMRLMVFQDYGFLAEVMFVLT FGAPVLFLLLCLYV 
I I I M I I I I I I i I t I I I I M I I t M I I I I M I M I M I I I I I I I I It I I M I I I I I It I I 
MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 
10 20 30 40 50 60 

70 80 90 100 110 120 

Y AAL I RK Q A Y P ALR LAT R VMVRLRQAMM V D V F FV S T L VA Y I KL S S VAE VR FG PA F Y LM FA 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M 1 I I I I I I I I I I I I I I I I I I I I I I I I I 
YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAEVRFGSAFYLMFA 
70 80 90 100 110 120 

130 140 150 160 170 180 

LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 

I I II I I I I I I I I II I I I I II I I I I I I I M I I I I I I I I I I I I I II I I I I II I I I I I M I I I 
LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
130 140 150 160 170 180 

190 200 210 220 230 240 

LYRRRPKSLSISSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 
I I II I I I I I I I I I I I I I I ! I I I I I I I I! I I II I I I ! M I I I II I I I I I I I I I I I I I M I I 
LYRRRPKSLS ISSAFLTAAV I LYFPAN I LP IMISSNPAATEVNT I LNGIAYMWDEGDRL I 
190 200 210 220 230 240 

250 260 270 280 290 300 

AAVIFSASILVPVLKIAAMSVLIASARFALPTGAKKLSHLYRITEAVGRWSMIDIFVIII 
I I I M I I I I I I I I I I I I II I I 1 I I I I I I I I I II I I I I 1 I I I I I I I M I I I II I I I I I I II 
AAVI FS AS ILVPVLKI AAMS VLI AS ARFALPTGAKKLSHLYRITEAVGRWSMI DI FVI I I 
250 260 270 280 290 300 

310 320 330 340 350 359 

LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 
I I I I I I I I II I 11 I I I I I I I I I I I I I I I I I I I I I I I I I II II M I II I I I I I I I I I I I I 
LMCSFHTYAARVIPGSAAVYFCLVVILTMLSAYYFDPRLLWDKRAS DGIAFNETEKHDX 
310 320 330 340 350 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 63 1>: 

gl54 -seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCTCAAG CACGCGTCCG 

51 CAAAAACAAC accttCCtCT CCGCCGTCTG GCTGGTCCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCG CGGAA GGCATCGAAG TCAACAATAC 

201 GGTCATTAAG GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTTA CTGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGcgtAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 



ml53 .pep 
al53 

ml53 .pep 
al53 

ml 53 .pep 
a!53 

ml53.pep 
al53 

ml53 .pep 
al53 

ml 5 3 . pep 
al53 
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401 ACATCGCTTT TACACCCGGC AAAAGCGGCG 

4 51 GTGCAGGACA TTCCGCCCGT TACCGCCATC 

501 GAATTTGATT GGTAAAAACG AccgCATCCT 

551 TG T ATGAAAA CTTTATGGTC GGGCAAATCG 

601 TCCGACCAAA GCGTGCATTA CACCATCTTC 

651 ACTGATTCAT TCCGCCAGCC GTTTTTGGCT 

701 AAACCACAGG CAGCGG CATC AAACTCAATT 

751 CTGTCAGGCG CGATTTCATT TGATTCGCCG 

801 CGTCAAAAGC GAGG AC AG CT TCACGCTTTA 

851 CCAACCTGCC TGACGACCGC TCGCTGTACT 

901 TCCGTGCGCG GACTGACCGT cggTTCGCCT 

951 TGTcggCATG GTTTCCGATG TCCCTTATTT 

1001 ACCtgtTTGA aaacggctgg aTTcccGtac 

1051 CGTTTGGAAA TCAATGCCGA CGAGCAAAGC 

1101 ATTCCAGACG GCCTTAAACA AAGGCCTGAC 

1151 ACCTGCTGAC CGGCGGCAAA ATGATTGAGT 

1201 TCGCCCAAGC TGCGACCGCA TACCGTTTAT 

1251 CACACGGGGC GGCGGTTTGG ATGACTTGCA 

1301 TGGACaaatT CAACAATCTG CCATTggata 

1351 GGCTCGCTCG CCGAACTCAA GTCCGCACTC 

14 01 AAGCTCCATT GacaAACTGG TCGgcaaTCC 

14 51 ACGAACTGAA CCAAACTCTG AAAGAGTTGC 

1501 TCGcctCAAT CGCCTATCT a cgGAgacgta 

1551 GGACAAAACC TTAAAagacg TtcaACCCGT 

1601 aacCCaaCgc actGATTTtc aacaACAGCA 

16 51 GGAAGCCGAT AA 

This corresponds to the amino acid sequence <SEQ ID 632; ORF 154.ng>: 

gl54 .pep 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGGW L WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSGEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQIESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPltPAL 

251 LSGAISFDSP KTKNSKNVKS EDS FTLYDS R SEIANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGM VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

3 51 RLEINADEQS KEHWKQQFQT AUCJKG LT AT I SSNNLLTGGK MIELNDQPSA 
401 SPKLRPHTVY AGDTVI ATRG GGLDDLQVKL ADLLDKFNNL PLDKTVAELN 

4 51 GSLAELKSAL KSANAALSSI DKLVGNPQTQ KTI PNELNQTL KELRITLQGV 
501 SPQSPIYGDV QNTXjQSLDKT LKDVQPVINT LKEKPNALIF NNSSKDPIPK 
551 GSR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 633>: 

ml 5 4 . seq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCCCAAG CACGCGTCCG 

51 CAAAAACAAC ACCTTCCTCT CTGCCGTCTG GCTGGTTCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATTGAGG TCAACAATAC 

2 01 GGTCATCAAA GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 
251 TGCGCGACGA CCAAAAAGGC GTGGAAGTAA CCGCCCAACT CAATGCGGAC 

3 01 GTATCCGGCC T CAT CCGC AG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 
351 TAT CGACCAA AGCGGCGTAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 

4 01 ACATCG C CTT TACACCCGGC AAAAGCGACG AGGCAAAAGA CGTGTTCCAA 
451 GTGCAGGACA TTCCGCCCGT TACCGCCATC GGGCAAAGCG GGCTGCGCTT 
501 GAATTTGATT GGTAAAAACG ACCGCATCCT CAACGTCAAC AGCCCTGTTT 
551 TG T ATGAAAA TTTTATGGTC GGGCAAGTCG AAAGCGCGCA TTTCGACCCG 
6 01 TCCGACCAAA GCGTGCATTA CACCATCTTC ATCCAAAGCC C C AACGACAA 
651 ACTGATTCAT TCCGCCAGCC GTTTTTGGCT GGAAAGCGGC AT CAATATCG 
701 AAACCACAGG CAGCGG CATC AAACTCAATT CCGCCCCTCT GCCTGCCCTG 
751 CTGTCGGGCG CGATTTCATT TGATTCGCCG AAAACCAAAA ACAGTAAAAA 
801 CGTCAAAAGC GAAGACAGCT TCACGCTTTA CGACAGCCGC AGCGAAGTCG 
851 CCAACCTGCC TGACGACCGC TCGCTGTACT ACACCGCGTT TTTCAAACAA 
901 TCCGTGCGCG GCCTGACCGT CGGTTCGCCC GTCGAGTACA AAGGGCTGAA 
951 TGTCGGCGTG GTTTCCGACG TTCCTTATTT CGACCGCAAC GACAG CCTGC 

1001 ACCTGTTTGA AAACGGCTGG ATACCCGTAC GCATCCGCAT TGAACCTTCC 

1051 CGTTTGGAAA TCAATGCCGA CGAACAAAGC AAAGAACATT GGAAACAACA 

1101 ATTTCAGACG GCCTTAAACA AAGGCCTGAC CGCCACCATC TCCAGCAACA 

1151 ACCTGCTGAC CGGAAGCAAA ATGATTGAGT TGAACGATCA GCCTTCCGCA 



AGGCAAAAGA 
GGGCAAAgcg 
CAACGTcaaC 
AAAGCGCGCA 
ATCCAAAGCC 
GGAAAGCGGC 
CCGCCCCTCT 
AAAACCAAAA 
CGACAGCCGC 
ACACCGCGTT 
GTcg aATACA 
TGACCGCAAt 
gCATCCGCAT 
AAAGAGCATT 
CGCCACCATC 
TGAACGATCA 
GCAGGCGATA 
GGTCAAATTG 
aAACCGTTGC 
AAATCCGCCA 
GCAGACGCAA 
GCATAACC CT 
caAAATAcgc 
CATTAACACT 
GCAAAGACCC 



CGTGTTCCAA 
GGCTGCGCTT 
AGCCCTGTTT 
TTTCGACCCG 
CCAACGACAA 
ATCAATATCG 
GCCTGCCCTG 
ACAGTAAAAA 
AG CGAAATCG 
TTTCAAACAA 
AAGGGCtgaA 
gacagCCTGC 
cgagccTTCC 
GGAAACAACA 
TCCAGCAACA 
GCCTTCCGCC 
CCGTCATCGC 
GCGGATTTGC 
CGAATTGAAC 
ATGCCGCCCT 
AACATCCCGA 
GCAAGGCGTA 
tgCaAAGTTT 
TTGAaAGAAa 
tATCCCGAAA 



WO 99/57280 



PCT/US99/09346 



441 



1201 TCACCTAAGC TGCGACCGCA TACCGTTTAT GCAGGCGATA CCGTTATCGC 

1251 GACCCAGGGC GGCGGTTTGG ACGATTTGCA GGTCAAATTG GCGGATTTGC 

1301 TGGACAAGTT CGACAAACTG CCTTTAGATA AG ACGGTTG C CGAATTGAAC 

1351 GGTTCGCTTG CCGAGCTCAA ATCCACACTC AAATCTG C C A ATGCCGCCCT 

1401 AAGCTCCATC GACAAACTGG TCGGCAAACC GCAGACACAA AACATTCCGA 

1451 ACGAACTGAA CCAAACCCTG AAAGAGTTGC GCACAACCCT GCAAGGCGTA 

1501 TCGCCGCAAT CG CCT ATCT A CGGCGACGTA CAAAATACGC TG C AAAGTTT 

1551 GGACAAAACT TTAAAAGACG TTCAACCCGT GATTAATACT TTGAAAGAAA 

1601 AACCCAACGC GCTGATTTTC AACAGCAGCA GCAAAGACCC TATCCCGAAA 

1651 GGAAGCCGAT AA 



This corresponds to the amino acid sequence <SEQ ID 634 ORF 154.a>: 

ml 5 4 .pep 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 

151 VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQVESAHFDP 

201 SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 

251 LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVANLPDDR SLYYTAFFKQ 

301 SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 

351 RliEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 

401 SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 

451 GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 

501 SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 

551 GSR* 



ml54 / gl54 97.8% identity in 553 aa overlap 



10 20 30 40 50 60 

ml54 .pep MTDNSPPPNGHAQARVRKNNTFLSAWLVPLIALIAGGWLWVKEIRNRGPVVTLIjMDSAE 

I IMMM MIMI II IM III IM III M I II I MINI I II M Ml II I II I I II I II 

gl54 mDNSPPPNGHAQARV^KNNTFLSAWriVPLIALIAGGWLWVKEIRNRGPVVTLIiMDSAE 
10 20 30 40 50 60 



70 80 90 100 110 120 

m!54 - pep G I EVNNTVI KVLS I DVGRVTRI KLRDDQKG VEVTAQLNAD VSGLIRSDTQFWVVKPRIDQ 

llllillllll IIIIMMIIIMI MINIMUM! Hill IIIIMM I I I MINI 

gl 54 G I EVNNTVI KVLS I DVGRVTRI KLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 

70 80 90 100 110 120 



13 0 14 0 15 0 16 0 170 180 

ml54 .pep S G VTGLGTLLSGS Y I AFT PG KSD EAKD VFQVQD I P P VT AI GQS G LRLN LI G KNDR I LNVN 

MMIMMIMM MIIIMI M 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II I II 1 1 1 1 1 1 1 1 

g!54 SGVTGLGTLLSGSYIAFTPGKSGEAKDVFQVQD I PPVTAIGQSGLRLNLIGKNDRILNVN 

130 140 150 160 170 180 



190 200 210 220 230 240 

ral54 -pep S P VL YENFMVGQVE S AHFD PSDQS VHYT I F I QS PND KL I H S AS RFWLESG I NI ETTGSG I 
1 1 i I 1 1 1 M 1 1 i : I I I I I I I i I II ! M I M I I M 1 1 1 II I 1 1 I I I 1 1 1 I II 

gl54 S P VL YENFMVGQ I ES AHFD P SDQS VHYT I F I QS PNDKL I HS AS RFWLE SG I N I ETTG SGI 

190 200 210 220 230 240 



250 260 2 70 280 2 90 300 

ml 54 .pep KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 I M II IMM M M : I MM I 1 1 I 1 M 1 1 I 1 1 

gl54 KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEIANLPDDRSLYYTAFFKQ 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml 54 .pep SVRGLTVGS PVEYKGLNVGWSDVPYFDRNDSLHLFENGWI PVRI RIEPSRLE INADEQS 

MM IM MM M I IMM Ml I I MM I I I I I I MM I II M MM I I I I II II II Ml 

9154 SVRGLTVGS PVEYKGLNVGMVSDVPYFDRNDSLHLFENGWI PVRI RIEPSRLE INADEQS 

310 320 330 340 350 360 

370 380 390 400 410 420 

II1154 .pep KEHWKQQFQT ALNKGLTAT I SSNNLLTGSKM I E LNDQPSAS PKLRPHTVYAGDTVI ATQG 

IMM llllll I! Mill Mill IMIMM M MM I II I I I Mill I II I IMM I M 

g!54 KEHWKQQFQTALNKGLTAT I SSNNLLTGGKMIELNDQPSAS PKLRPHTVYAGDTVI AT RG 
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370 



380 



390 



400 



410 



420 



430 440 450 460 470 480 

ml54 . pep GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKSANAALSSIDKLVGKPQTQ 

llllllllllllllll|::|||||||lll!MlllllhMlllilllllllllhllll 
gl54 GGLD D LQVKIiADLIjD KFNN L PLDKT VAE LNGS LAE LKS ALKS AN AALS S I DKIjVGN PQTQ 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml54 . pep NIPMELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

Illlllllllllll II IMMI II II IIIMI Mill lllll IIIMIII IIIIIMM 

gl54 NI PNE LNQTLKE LR I TLQG VS P Q S P I YGD VQNTLQS LD KTLKD VQ P VI NTLKEKPNAL I F 

490 500 510 520 530 540 



inl54 .pep 
gl54 



550 

NSSSKDPIPKGSRX 

hllllMIIIIM 

NNSSKDPIPKGSRX 
550 



The following partial DNA sequence was 

al54 . seq 

ATGACTGACA ACAGCCCTCC 
CAAAAACAAC ACCTTCCTCT 
TGATTGCCGG CGGCTGGCTT 
GTGGTTACGC TCTTGATGGA 
GGTCATCAAA GTATTGAGCA 
TGCGCGACGA CCAAAAAGGC 
GTATCCGGCC TCATCCGCAG 
T AT CG AC C AA AGCGGCGTAA 
ACATCGCCTT TACACCCGGC 
GTGCAGGACA TTCCGCCCGT 
GAATTTGATT GGTAAAAACG 
TGTATGAAAA CTTTATGGTC 
TCCGACCAAA GCGTGCATTA 
ACTGATTCAT TCCGCCAGCC 
AAAC C AC AG G C AG CG G CAT C 
CTGTCGGGCG CGATTTCATT 
CGTCAAAAGC GAAGACAGCT 
CCAACCTGCC TGATGACCGT 
TCCGTGCGCG GACTGACCGT 
TGTCGGCGTG GTTTCCGATG 
ACCTGTTTGA AAACGGCTGG 
CGTTTGGAAA TCAATGCCGA 
ATTTCAGACG GCCTTAAACA 
ACCTGCTGAC CGGCAGCAAA 
TCGCCCAAGC TGCGACCGCA 
GACCCAGGGC GGCGGTTTGG 
TGGACAAGTT CGACAAACTG 
GGTTCGCTTG CCGAGCTCAA 
AAGCTCCATC GACAAACTGG 
AC G AACT G AA CCAAACCCTG 
TCGCCTCAAT CGCCTATCTA 
GGACAAAACC TTAAAAGACG 
AACCCAACGC GCTGATTTTC 
GGAAGCCGAT AA 



identified in N. meningitidis <SEQ ID 635>: 



l 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 



TCCAAACGGA 
CTGCCGTCTG 
TGGGTTAAGG 
CAGCGCGGAA 
TCGATGTCGG 
GTGGAAGTAA 
CGATACCCAG 
CCGGTTTGGG 
AAAAGCGACG 
TACCGCCATC 
ACCGCATCCT 
GGGCAAGTCG 
CACCATCTTC 
GTTTCTGGCT 
AAACTCAATT 
TGATTCGCCG 
TCACGCTTTA 
TCGCTGTACT 
CGGTTCGCCT 
TTCCTTATTT 
ATTCCCGTAC 
CGAACAAAGC 
AAGGCCTGAC 
AT GAT T GAG T 
TACCGTTTAT 
ACGATTTGCA 
CCTTTAGATA 
ATCCACACTC 
TCGGCAAACC 
AAAGAGTTGC 
CGGCGACGTA 
TTCAACCCGT 
AACAGCAGCA 



CACGCCCAAG 
GCTGGTTCCG 
AAATCCGCAA 
GGCATTGAGG 
ACGCGTTACC 
CCGCCCAACT 
TTTTGGGTGG 
TACGCTGCTT 
AGGCAAAAGA 
GGGCAAAGCG 
CAACGTCAAC 
AAAGCGCGCA 
ATCCAAAGCC 
GGAAAGCGGC 
CCGCCCCTCT 
AAAACCAAAA 
CGACAGCCGC 
ACACCGCGTT 
GTCGAGTACA 
CGACCGCAAC 
GCATCCGTAT 
AAAGAACATT 
CGCCACCATC 
TGAACGATCA 
GCAGGCGATA 
GGTCAAATTG 
AGACGGTTGC 
AAATCTGCCA 
GCAGACACAA 
GCACAACCCT 
CAAAATACGC 
CATTAACACT 
GCAAAGACCC 



CACGCGTCCG 
CTGATCGCGC 
CAGGGGGCCT 
T C AACAAT AC 
CGAATCAAAC 
CAATGCGGAC 
TCAAGCCGCG 
TCGGGTTCGT 
CGTGTTCCAA 
GGCTGCGCTT 
AGCCCTGTTT 
TTTCGACCCG 
CCAACGACAA 
ATCAATATCG 
GCCTGCCCTG 
ACAGTAAAAA 
AGCGAAGTCG 
TTTCAAACAA 
AAGGGCTGAA 
GACAGCCTGC 
TGAGCCTTCC 
GGAAACAACA 
TCCAGCAACA 
GCCTTCCGCC 
CCGTTATCGC 
GCGGATTTGC 
CGAATTGAAC 
ATGCCGCCCT 
AACATTCCGA 
GCAAGGCGTA 
TGCAAAGTTT 
TTGAAAGAAA 
TATCCCGAAA 



This corresponds to the amino acid sequence <SEQ ID 636; ORF 154.a>: 

al54 .pep 

1 MTDNSPPPNG HAQARVRKNN T FLSAVWLVP LIALIAGG WL WVKEIRNRGP 

51 WTLLMDSAE GIEVNNTVIK VLSIDVGRVT RIKLRDDQKG VEVTAQLNAD 

101 VSGLIRSDTQ FWWKPRIDQ SGVTGLGTLL SGSYIAFTPG KSDEAKDVFQ 
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151 
201 
251 
301 
351 
401 
451 
501 
551 

ml 54/al 54 1 00.0% identity in 553 aa overlap 

10 20 30 40 50 60 

ml 5 4 . pep MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPVVTLLMDSAE 
I I I I I I ! I I M I I I I I I I I I I I I I II I I I I I M I I M I I I i II I I I I I I I I I M M M I I 
al54 MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWKEIRNRGPVVTLLMDSAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml54 .pep GIEVNNTVIKVLS I DVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRI DQ 

M I I I I I I I I I I I I I I I i I I I I I I I I I M I I I I I 1 I I I I I I I I ! I I M t II I I i I I II M 
al54 GIEVNNTVIKVLS I DVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml54 .pep SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

I I I I M I I I I I M I I M I I I I I I I I I I I I I I M I I I I I I I II I I M I I I I II I I I I 1 M I 
a 154 SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml54 .pep SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPNDKLIHSASRFWLESGINIETTGSGI 
I | | I I I I II I I I I I I I I M II II I I I ! I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I 
al54 SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQS PNDKLIHSASRFWLESGINIETTGSGI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml54 . pep KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 
I I I I I I I I I 1 I I M 1 I I I I I I I I I I I M I I 11 I i I I I I I I I 1 I I I I M I I I I I I I I I I I I 
al54 KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml54 . pep SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 
I I I I I I i I I I M I I I I I I I I M I I I 1 I I I I I I I I I M I M I I I II I I I I I I I I II I I I I I 
a!54 SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml54 . pep KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 
I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I M I II I I I I I I I I I 
al54 KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml54 . pep GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKSANAALS SI DKLVGKPQTQ 

I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I M I I I! I I I I M I I I II I I I I I I I M I 
a 1 5 4 GGLDDLQVKLADLLDKFDKLPLDKTVAELNG S LAELKSTLKS ANAALS S I DKLVGKPQTQ 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml54 . pep NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 
I I I I I I I I M I I I I I I I I I I I I II I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M 
al54 NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

490 500 510 520 530 540 



VQDIPPVTAI GQSGLRLNLI GKNDRILNVN SPVLYENFMV GQVESAHFDP 
SDQSVHYTIF IQSPNDKLIH SASRFWLESG INIETTGSGI KLNSAPLPAL 
LSGAISFDSP KTKNSKNVKS EDSFTLYDSR SEVAN LP DDR SLYYTAFFKQ 
SVRGLTVGSP VEYKGLNVGV VSDVPYFDRN DSLHLFENGW IPVRIRIEPS 
RLEINADEQS KEHWKQQFQT ALNKGLTATI SSNNLLTGSK MIELNDQPSA 
SPKLRPHTVY AGDTVIATQG GGLDDLQVKL ADLLDKFDKL PLDKTVAELN 
GSLAELKSTL KSANAALSSI DKLVGKPQTQ NIPNELNQTL KELRTTLQGV 
SPQSPIYGDV QNTLQSLDKT LKDVQPVINT LKEKPNALIF NSSSKDPIPK 
GSR* 
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550 

m!54 .pep NSSSKDPIPKGSRX 
II I I I I ! I I I I I I I 
al54 NSSSKDPIPKGSRX 

550 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 637>: 

gl55 . seq 

1 atGAAaatcg GtatcCCACG CGAGTCAtta tcCGGCGAAA cccgcgtagc 

51 ctgcAcgccc gCCACCGTTG CCctgctggg caAactAGGC TTTGAAACCG 

101 TTGtcgaAAG CGGTGCAggt TTGGCGGCAA GTTTggaCGA TGCCGCTTAC 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGCCTGCCC 

201 TTTAATTTAT AAGGTCAACG CGCCGTCCGA AGGCGAGCTG CCGCTGCTCA 

251 AAGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 

301 TTGGTCGAGG CCTTGCGCGC CAAGAAAGTC AACGCGCTGG CG AT GG AC AT 

351 GGTTCCCCGC ATTTCCCGCG CTCAGGCCTT GGACGCTTTG TCTTCAATGG 

4 01 CAAACATCAG CGGCTACCGC GCCGTGATTG AAGCCGCCAA CGCCTTCGGC 

4 51 CGTTTCTTCA CCGGTCAAAT CACTGCCGCC GGCAAAGTGC CGCCTGCGCA 

501 GGTTTTGGTG ATTGGCGCCG GTGTGGCGGG TTTGGCGGCA ATCGGTACGG 

551 CAAATTCGCT CGGCGCAGTG GTGCGCGCGT TCGATACCCG CTTGGAAGTG 

601 GCGGAACAAA TCGAATCGAT GGGCGGTAAG TTcctGAAAC TCGACTTCCT 

651 GCAAGAATCG GGCGGCAGCG GAGACGgctA CGCCAAAGTG ATGAGCGACG 

7 01 AATTTATCGC CGCCGAAATG AAGCTCTTTG CCGAACAGGC GAAAGAAGTG 

7 51 GACATCATCA TCACCACCGC CGCCATTCCG GGCAAACCCG CTCCCAAGCT 

801 GATTACCAAA GAAATGGTGG AAAGCATGAA ATCCGGATCC GTCATCGTCG 

851 ATTTGGCGGC GACGGGCGGC AACTGCGAAC TCACCCGACC GGGCGAATTG 

901 TCCGTAACCG GCAACGGCGT G AAAAT CAT C GGCTACACCG ACATGGCAAA 

951 CCGCCTTGCC GGACAGTCTT CCCAGCTTTA CGCCACCAAC TTGGTGAACC 

1001 TGACCAAGCT GTTAAGCCCG AACAAAGAcg gcgaAATCAC GCTGGACTTC 

1051 GAAGacgtGA TTATCCGCAA TATGACCGTT ACCCGcgacg gcgaaATCAC 

1101 CTTCCCGCCT CCGccgaTTc aggtTTCcgc ccggccgCAG CAAAcgccgt 

1151 ctgaAAAagc cgcGCCTGCC GCCAagcccg AgccGaaacc tgttCCcctg 

12 01 tggaAAAaac tcgCGCCCGC CGCcatcgCC GCCGTATTGG tgctgtgGgt 

1251 cggCgcggtc gcacccgcag CATTCTTGAA CCACTTTATC GTCTTCGTCC 

1301 TCGCCTGCGT CATCGGCTAC CATGTCGTTT GgaacgTCAG CCACTCGCTG 

1351 CACACACCGC TGAtgtcggt aaccaaCgcc atctccGGCA tcatggtcgt 

14 01 cggCGCGCTG CTGCAAATCG GTCAGGGcaa cggcttcgtT TCgctGCTGT 

14 51 CGTTTGTTGC CAT C CT G AT T GCCGGCATCA ATATCTTCGG CGGCTTTGCG 

1501 GTTACACGGC GTATGCTGAA TATGTTTAAG AAAGGGTAA 



This corresponds to the amino acid sequence <SEQ ID 638; ORF 155.ng>: 

gl55 . pep 

1 MKIGIPRESL SGETRVACTP AT V AL LGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWACPLIY KVNAPSEGEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV N ALAMDMV PR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTAN SLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFLQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAATGG NCELTRPGEL 

301 SVTGNGVKII GYT DMANRLA GQSSQLYATN LVNLTKLLSP NKDGEITLDF 

351 EDVIIRNMTV TRDGEITFPP PPIQVSARPQ QTPSEKAAPA AKPEPKPVPL 

4 01 WKK LAPAAIA AVLVLWVGAV AP AAFLNHFI VFVLACVIG Y HWWNVSHSL 

4 51 HTF LMSVTNA ISGIMWGAL LQIGQGNG FV SLLSFVAILI AGINI FGGFA 

501 VTRRMLNMFK KG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 639>: 

ml55 . seq 

1 ATGAAAATCG GTATCCCACG CGAGTCATTA TCCGGCGAAA CCCGCGTCGC 

51 CTGTACGCCC GCCACCGTCG CCCTGCTGGG CAAACTGGGC TTTGAAACCG 

101 TTGTCGAAAG CGGTGCAGGT TTGGCGGCAA GTTTGGACGA TGCCGCTTAC 

151 CAAACAGCAG GCGCAACCGT TGCCGACAAA GCGGCGGTTT GGGTCTGCCC 

201 TTTGATTTAT AAGGTCAACG CGCCGTCCGA ACAGGAACTG CCGCTTTTGA 

251 ACGAAGGTCA AACCATCGTC AGCTTCCTGT GGCCGCGCCA AAACGAGGCT 
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301 TTGGTCGAAG CCTTGCGCGC CAAGAAAGTG AACGCGCTGG CGATGGATAT 

351 GGTGCCCCGC ATTTCGCGCG CGCAGGCTTT GGACGCTTTG TCTTCGATGG 

4 01 CAAACATCAG CGGCTACCGC GCCGTAATTG AAGCCGCCAA CGCCTTCGGC 

4 51 CGTTTCTTCA CCGGTCAAAT TACCGCCGCC GGCAAAGTGC CGCCCGCGCA 

501 GGTTTTGGTG ATTGGTGCAG GTGTGGCAGG TTTGGCGGCG ATCGGTACGG 

551 CAAACTCGCT CGGCGCAGTG GTACGCGCGT TCGATACCCG CTTGGAAGTG 

601 GCGGAACAAA TCGAATCGAT GGGCGGCAAG TTCCTGAAAC TCGACTTCCC 

651 ACAAGAATCG GGCGGCAGCG GAGACGGCTA CGCCAAAGTG ATGAGCGACG 

701 AATTTATCGC AGCCGAGATG AAGCTCTTTG CCGAGCAGGC GAAAGAAGTG 

7 51 G AC AT CATC A TCACCACCGC CGCCATTCCG GGCAAACCCG CGCCCAAGCT 

801 GATTACCAAA GAAATGGTGG AAAGCATGAA ATCCGGCTCC GTCATCGTCG 

851 ATTTGGCGGC GGCGACGGGC GGCAACTGCG AACTCACCCG CCCGGGCGAA 

901 TTGTCCGTAA CCGGCAACGG CGTGAAAATC ATCGGCTACA CCGACATGGC 

951 AAACCGCCTT GCCGGACAGT CTTCCCAGCT TTACGCCACC AACTTGGTCA 

1001 ACCTGACCAA GCTGTTAAGC CCGAACAAAG ACGGCGAAAT CACGTTGGAC 

1051 TTCGAAGACG TGATTATCCG CAACATGACC GTTACCCACG ACGGCGAAAT 

1101 CACCTTCCCG CCTCCGCCGA TTCAAGTTTC CGCCCAGCCG CAGCAAACGC 

1151 CGTCTGAAAA AGCCGTGCCT GCCGCCAAGC CCGAGCCAAA ACCCGTTCCC 

1201 CTGTGGAAAA AACTCGCGCC CGCCGTCATC GCCGCCGTCT TGGTACTGTG 

1251 GGTCGGCGCG GTCGCACCCG CAGCATTCCT GAACCACTTT ATCGTGTTCG 

1301 TTCTCGCCTG CGTCATCGGC TACTACGTCG TCTGGAACGT CAGCCACTCG 

1351 CTGCACACAC CGCTGATGTC GGTAACCAAC GCCATCTCCG GCATCATCGT 

14 01 CGTCGGCGCG CTGCTGCAAA TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 

14 51 TGTCGTTTGT TGCCATCCTG ATTGCCGGCA TCAACATCTT CGGCGGCTTT 

1501 GCGGTAACAC GGCGTATGCT GAATATGTTT AAGAAAGGGT AA 



This corresponds to the amino acid sequence <SEQ ID 640; ORF 155>: 

ml 55 .pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWVCP LIY KVNAPSEQEL PLLN EGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPAQVLV IGAGVAGLAA IGTANSLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAAATG GNCELTRPGE 

301 LSVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTHDGEITFP PPPIQVSAQP QQTPSEKAVP AAKPEPKPVP 

4 01 LWKK LAPAVI AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYWWNVSHS 

4 51 LHTP LMSVTN AISGIIWGA L LQIGQGNG F VSLLSFVAIL IAGINI FGGF 

501 AVTRRMLNMF KKG* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 155 shows 97.9% identity over a 513 aa overlap with a predicted ORF (ORF 155.ng) 
from N. gonorrhoeae: 

ml 5 5 / gl55 97 . 9% identity in 513 aa overlap 

10 20 30 40 50 60 

ml55 . pep MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 
I I I t I I M I I I I M I t M I I I I I I I I I M I I I M I II I II I t 1 I M t I I I I I I 11 I I I I I 
g 155 MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAY QTAGATVADK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml55 . pep AAWVCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 
I I I I : I I I I I I M I I M I I I I I : I I I I I I I I I I I I I I I M I I I i 1 M I I 1 1 I I I I I I I I 
gl55 AAVWACPLIYKVNAPSEGELPLLKEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml55 . pep I SRAQALDALS SMAN I SG YRAV IEAANAFGRFFTGQ ITAAGKVP PAQVLV I GAGVAGLAA 

I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I M I I I I I I M I I II M I II I I I I I II II 
gl55 I SRAQALDALS SMAN I SGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGV AG LAA 
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130 140 150 160 170 180 

190 200 210 220 230 240 

ml55 pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 
| 1 M | I I I M I I I M M II I I I I I I 1 I I 1 I I I I I I I I I I I I I I I I I I I I M I I I I I t I I 
ql55 IGTANSLGAVVRAFDTRLEVAEQIESMGGKFLKLDFLQESGGSGDGYAKVMSDEFIAAEM 

190 200 210 220 230 240 

250 260 270 280 290 300 

m!55 pep KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAAATGGNCELTRPGE 

MINI I | I I I I I I I I 1 I I I I M M I IE I I I I 1 I I I I 11 M II 11 I I I I M I I 

g 155 KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVESMKSGSVIVDLAA-TGGNCELTRPGE 

250 260 270 280 290 

310 320 330 340 350 360 

ml55 pep LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 

1 I I I I M I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I M I I I I ! I I M I I II I I I I I 
g 155 LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 
300 310 320 330 340 350 

370 380 390 400 410 420 

ml55 pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVIAAVLVLWVGA 

| | : | | M I II I II I I I I I : t I M : I I I I I I I M I I 1 M I I I I I : M M II I I I I I 

gl55 VTRDGEITFPPPPIQVSARPQQTPSEKAAPAAKPEPKPVPLWKKLAPAAIAAVLVLWVGA 
360 370 380 390 400 410 

430 440 450 4 60 470. 480 

ml55 pep VAPAAFLNHFIVFVLACVIGYYWWNVSHSLHTPLMSVTNAISGIIWGALLQIGQGNGF 
I I I M M I I I I I i I M i I I II : I I M I I I M I I I I I I I I I M II I : I I I M M I I I i I I I 
g 155 VAPAAFLNHFIVFVLACVIGYHWWNVSHSLHTPLMSVTNAISGIMWGALLQIGQGNGF 
420 430 440 450 460 470 

490 500 510 

ial55 . pep V S L L S FVAI L I AG INI FGG FAVTRRM LNM FKKGX 

I ! M M I I I I 1 I M M I I I M I I I 1 I I I I I I I I I 
g 155 VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
480 490 500 510 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 64 1>: 



al55 . seq 




1 


ATGAAAATCG 


51 


CTGTACGCCC 


101 


TTGTCGAAAG 


151 


CAAGCAGCAG 


201 


TTTAATTTAT 


251 


AAGAAGGACA 


301 


TTGGTCGAAG 


351 


GGTGCCCCGC 


401 


CAAACATCAG 


451 


CGTTTNTTCA 


501 


GGTTTTGGTG 


551 


CAAACTCGCT 


601 


GCGGAACAAT 


651 


GCAAGAATCG 


701 


AATTTATCGC 


751 


GACATCATCA 


801 


NNTNANCAAA 


851 


ATTTGGCGGC 


901 


TTGTTCGTAA 


951 


AAACCGCCTT 


1001 


ACCTGACCAA 


1051 


TTCGAAGACG 


1101 


CACCTTCCCG 


1151 


CGTCTGAAAA 
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1201 CTGTGGAAAA AACTCGCGCC 

1251 GGTCGGCGCG GTCGCACCCG 

1301 TCCTCGCCTG CGTCATCGGC 

1351 CTGCACACAC CGCTGATGTC 

14 01 CGTCGGCGCG CTGCTGCAAA 

14 51 TGTCGTTTGT TGCCATCCTG 

1501 TTTGTAACGC GGCGGATGCT 



CGCCNTNATC GCCGCCGTGT TGGTACTGTG 
CAGCATTCCT GAACCACTTT ATCGTCTTCG 
TACTATGTCG TTTGGAACGT CAGCCACTCG 
GGTGACCAAC GCCATTTCCG GCATCATCGT 
TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 
ATTGCCAGCA TCAACATCTT CGGCGGCTTC 
GAATATGTTT AGGAAAGGGT AA 



This corresponds to the amino acid sequence <SEQ ED 642; ORF 155.a>: 

al55 .pep 

1 MKIGIPRESL S GET R VAC TP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QAAGATVADK AAVWAYPLIY KVNAPSEDEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDXL SXMANISGYR AVIEAANAFG 

151 RXFTGQITAA GKVPPA QVLV IGAGVAGLAA IGTA NSLGAV VRVFDTRLXV 

201 AEQLESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKXXXK EMVESMKPGS VIVDLAAATG GNCELTKQGE 

301 LFVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTRDGEITFP PPPIQVSAQP QQTPSEKAAP AAKPEPKPVP 

4 01 LWKK LAPAXI AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYWWNVSHS 

4 51 LHTP LMSVTN AISGIIWGA L LQIGQGNG F VSLLSFVAIL IASINI FGGF 

501 FVTRRMLNMF RKG* 

m!55/al55 95.3% identity in 513 aa overlap 



10 20 30 40 50 60 

ml55 . pep MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 
I I I I I I I I I I I 1 I I i I I 11 t I f I I I I I I I i I I 1 I I I I I I I I 1 M i II I I I I : I I I i M I I 
al55 MKIGIPRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQAAGATVADK 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml55 . pep AAVWVCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 
till: I I I I I I I I I I I : I I I I I : I I j I I I I II I I M I I 1 I I I II I I I I M I I M I I 1 I t 
a!55 AAVW A Y P L I Y KVN A P S E DE L P LLKE G QT I V S F LW PRQNE AL VE ALRAKKVN AL AM DMV PR 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml55 . pep ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 
I I I I I I I I I I M I I !! I I I I I I I M II 1 I I I I I I II II 1 M I I M I II I I I II I I I I 
al55 ISRAQALDXLSXMANISGYRAVIEAANAFGRXFTGQITAAGKVPPAQVLVIGAGVAGLAA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml55 . pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 
I I I I I I II I I I I : II I II I I II : I I I II I I I I I M I I I I I I I It M II I I I I I I II M I 
al55 IGTANSLGAVVRVFDTRLXVAEQLESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 

190 200 210 220 230 240 



250 260 270 280 290 300 

m!55 . pep KLFAEQAKEVDIIITTAAIPGKPAPKLITKEMVE SMKSGSVIVDLAAATGGNCELTRPGE 

I I M t I I I I I I 1 I M I M I 11 I I I I I MINIMI I I I I I I I I I I I I I I I M I : II 
a!55 KLFAEQAKEVDIIITTAAIPGKPAPKXXXKEMVESMKPGSVIVDLAAATGGNCELTKQGE 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml55 . pep LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLS PNKDGEITLD FEDVIIRNMT 

I 1 I I I I I I I I I I I I I II I I I I II I I I I I I I 1 1 1 I M M I I I I I I I I I I I I I I I I I I I I I 
al55 LFVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 55 . pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVIAAVLVLWVGA 
I I : I I I I I I II I M I I I I 1 I I I I I I I I I : I I I I I M I I I I II I II I M I I I II I I II I I 
a!55 VTRDGEITFPPPPIQVSAQPQQTPSEKAAPAAKPE PKPVPLWKK LAP AX I AAVLVLWVGA 

370 380 390 400 410 420 
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430 440 450 460 470 480 

ml55 . pep VAPAAFLNHFIVFVLACVIGYYWWNVSHSLHTPLMSVTNAISGIIWGALLQIGQGNGF 
I I I I M M 1 II M I ! I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I II I I I I I I 1 i 
al55 VAPAAFLNHFIVFVLACVIGYYWWNVSHSLHTPLMSVTNAISGIIWGALLQIGQGNGF 

430 440 450 460 470 480 

490 500 510 

ml55 .pep VSLLSFVAILIAGINIFGGFAVTRRMLNMFKKGX 
I M II I I M I I I : 1 I I I 1 M 111111111:111 
a 1 5 5 VSLLS FVAI L I AS IN I FGGFFVTRRMLNMFRKGX 

490 500 510 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 643>: 

gl56. seq 

1 ATGACTTTCG CCTATTGGTG CATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCTCG CGGTTTTCTG GCACATACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCGC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC CGGACAAGCA ACCGTCAACA 

251 CGCTTGCCGG ATTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAC 

301 ATCGCAGACA AAGCAGCATT GCGCTCGCTG ATGTGGGCGG GCGGATTTGC 

351 CTGCACCGTC GGACTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 644; ORF 156.ng>: 

gl56.pep 

1> MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHTQGAAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 IADKAALRS L MWAGGFACTV GLFVAA A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 645>: 

ml56. seq 

1 ATGACTTTCG CCTATTGGTG TATTCTGATT GCCTGCCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCGCG CGGTTTTCTA GCGCACACGC AAGGCGCAGC CGCCCGTGCC 

151 CACGCCGCAC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCCGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC GGCGCAATCG ACCATCAACA 

251 CGCTTGCCTG CCTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAT 

301 ATCGCCGACA AAGCCGCTAT GCGCTCACTG ATGTGGGCAG GCGGATTTGC 

351 CTGCACCGTC GGGCTGTTTG TCGCGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 646; ORF 156>: 

ml56.pep 

1 MTFAYWCILI ACLLPLFCAA YAKKAGGFRF KDNHNPRGFL AHTQGAAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAAQS TINT LACLFI LFRLAFIWCY 

101 IADKAAMRS L MWAGGFACTV GLFVAA A* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 



ml56 / gl56 96.1% identity in 127 aa overlap 

10 20 30 40 50 60 

ml 5 6 . pep MT FA YW C I L I ACLL P L FC AAYAKKAGG FRFKDNHN PRG FLAHTQG AAARAHAAQQNG FE A 

I I I I II II I I I I I M I I i I I I I I 1 I I I I II I I I I I ! i 11 I I 11 I I I I I I I II I I I I I I II 
gl56 MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFLAHTQGAAARAHAAQQNGFEA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml56 . pep FAPFAAAVLTAHATGNAAQSTINTLACLFILFRLAFIWCYIADKAAMRSLMWAGGFACTV 
I II I I I I I I I I I I I I 11 : I : I : M 1 1 M 11 I I I I I I I I II I I I 1 I : 1 M M I M M I I I 
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g 15 6 FAPFAAAVLTAHATGNAGQATVNTLAGLFILFRLAFIWCYIADKAALRSLMWAGGFACTV 

70 80 90 100 110 120 

ml 5 6 . pep GLFVAAAX 
I I I I I I I I 
glS6 GLFVAAAX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 647>: 

al56 . seq 

1 ATGACTTTCG CCTATTGGTG TATTCTGATT GCCTACCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCGCG CGATTTTCTG GCGCGCACGC AAGGCACAGC CGCCCGTGCC 

151 CACGCCGCGC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCAGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC CGGACAAGCA ACCGTCAACA 

251 CGCTTGCCGG CCTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAC 

301 ATCGCAGACA AAGCAGCATT ACGCTCGCTG ATGTGGGTGG GCGGATTTGT 

351 CTGCACCGTC GGGCTGTTTG TCGTGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 648; ORF 156.a>: 

al56 .pep 

1 MTFAYWCILI AYLLPLFCAA YAKKAGGFRF KDNHNPRDFL ARTQGTAARA 

51 HAAQQNG FE A FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 IADKAALRS L MWVGGFVCTV GLFWA A* 

ml56/al56 90.6% identity in 127 aa overlap 

10 20 30 40 50 60 

ml56.pep MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFLAHTQGAAARAHAAQQNGFEA 
I M I M I I I I I I I i I I I I I I I I I M I M I I M I I I I E I I : I I I : I I I I I M I I I I 1 I I 
a 156 MT FAYWC I LI AYLLP L FC AAYAKKAGG FRFKDNHNPRDFLARTQGT AARAHAAQQNG FE A 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml56 . pep FAPFAAAVLTAHATGNAAQSTINTLACLFILFRIAFIWCYIADKAAMRSLMWAGGFACTV 
I I I I I I M I M i I I I I I : I : 1 : II I I It I I I I I I I I i I I I I I I I I : I M I I : I I I : I I I 
al56 FAPFAAAVLTAHATGNAGQATVTsITLAGLFI LFRLAFIWCY I ADKAALRSLMWVGGFVCTV 

70 80 90 100 110 120 



ml 5 6. pep GLFVAAAX 
1111=111 
al56 GLFWAAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 649>: 

gl57 . seq 

1 atgaggaacg aggAAAAACg cgccctgcgc cgcgaattgC gCgGgcggcg 

51 ttcgcAAATg GGgcgagacg tGCGggCGGC GGCGgCgatA Aaaatcaacc 

101 gcctgctcaa aCGTtatatc AAGCGCggtc gGaAaatcgG CGTGTATTgg 

151 cCGATGGGCA AGGAATTGcg TTTGGGCGgc tTtgtcCGCG CGGCGCAAAA 

201 ACGCgGCGCA AAactctatc tgccttATAT CGAACCGCAC ACGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GAACGCGGAA TGGAACGGGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGGCGCA AAATCCGCGT 

351 GCACGGTTTG TCGGTATTGC TCGTCCCGCT TGTCGGCATA GACCGCGAAG 

4 01 GCTACCGTTT GGGGCAGGCA GGCGGCTATT ACGATGCGAC GCTTTCGGCG 

451 ATGAAATACC GTTTGCAGGC GAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTGGTGGAC AGGCTCCCAC GCGAGGCGCA CGACCTGCCG CTGGACGGTT 

551 TTGTATCGGA AGCGGGGATA TTGTGTTTTT AG 
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This corresponds to the amino acid sequence <SEQ ID 650; ORF 157.ng>: 

gl57 . pep 

1 MRNEEKRALR RELRGRRSQM GRDVRAAAAI KINRLLKRYI KRGRKIGVYW 

51 PMGKELRLGG FVRAAQKRGA KLYLPYIEPH TRRMWFTPYP ERGMERERKR 

101 GRAKLHVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLSA 

151 MKYRLQAKTV GVGFACQLVD RLPREAHDLP LDGFVSEAGI LCF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65 1>: 

ml57 . seq 

1 ATGAGGAACG AGGAAAAACG CGCCCTGCGC CGCGAATTGC GCGGGCGGCG 

51 TTCGCAAATG GGGCGGGACG TGCGGGCGGC GGCAACGGTA AAAATCAACC 

101 ACCTGCTCAA ACGTTATATT AAAAAAGGGC GGAAAATCGG CGTGTATTGG 

151 CCGATGGGCA AGGAATTGCG TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 

201 ACGCGGTGCG GAACTCTACC TGCCTTATAT CGAACCGCGT TCGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GCCGATGGAG TAAAACAAGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGTCGGA AAAAGCGTGT 

351 GCATGATTTG AACCTCCTGC TTGTGCCAGT GGTCGGTATG GACAGGCTGG 

4 01 GCTACCGCTT GGGACAGGCA GGCGGCTATT ACGATGCGAC GCTTTCAGCG 

451 ATGAAATACC GTTTGCAGGC AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTGGTGGAC AGGCTGCCGG TCGAGGCGCA CGACCGGTCT TTGGACGGTT 

551 TTGTGTCGGA GGCGGGG AT A TTGTGTTTTT AG 



This corresponds to the amino acid sequence <SEQ ID 652; ORF 157>: 

ml57 .pep 

1 MRNEEKRALR RELRGRRSQM GRDVRAAATV KINHLLKRYI KKGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA ELYLPYIEPR SRRMWFTPYP ADGVKQERKR 

101 GRAKLHVPQF AGRKKRVHDL NLLLVPWGM DRLGYRLGQA GGYYDATLSA 

151 MKYRLQAKTV GVGFACQLVD RLPVEAHDRS LDGFVSEAGI LCF* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml57 / gl57 88.1% identity in 193 aa overlap 

10 20 30 40 50 60 

ml57 . pep MRNEEKRALRRELRGRRSQMGRDVRAAATVKINHLLKRYIKKGRKIGVYWPMGKELRLDG 
I I I I I I I I I I I I I I I I Ml I I I I I I I I I :: M I : I 1 I I M I : I I 1 I II I I I I I I M I I i 
gl57 MRNEEKRALRRELRGRRSQMGRDVRAAAAI KINRLLKRYI KRGRKIGVYW PMGKELRLGG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 57 . pep FVRAAQKRGAELYLPYIEPRSRRMWFTPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 
1 I I I I M I I I : M I It I 1 I :: I I It I M I I I ::: I I I I I I I I I t I M I I I I I I I I I 
gl57 FVRAAQKRGAKLYLPYIEPHTRRMWFTPYPERGMERERKRGRAKLHVPQFAGRKIRVHGL 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 57 . pep NLLLVPWGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 

: : I I I I : M : I I I I I I It I 11 t I I I I I I I i I I I I I I M I I M I I I I I M I I I I I t I 
g 157 SVLLVPLVGI DREGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPREAHDLP 

130 140 150 160 170 180 



190 

ml57 . pep LDGFVSEAGI LC FX 

I I M I I I 1 I I I I I I 
g!57 LDGFVSEAGILCFX 

190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 653>: 

al57 . seq 

1 ATGAGGAACG AGGAAAAACA CGCCTTGCGC CGAGAGTTGC GCCGCGCCCG 
51 CGCGCAGATG GGGCATCAAG GGCGGTTGGC GGCGGGGCAA ACGATTAACC 
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101 GCCTGCTCAA ACGTTATATC AAGCGTGGTC GGAAAATCGG CGTGTATTGG 

151 CCGATGGGCA AGGAATTGCG TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 

201 ACGCGGTGCA AAACTTTATC TGCCTTATAT CGAACCGCGT TCGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GAAAGCGGAA TGGAACGGGA GCGCATACGG 

301 GGCAGGGCGA AGTTGAACGT GCCGCAGTTT GCAGGGCGCA AAATCCGCGT 

351 GCACGGTTTG TCGGTATTGC TCGTCCCGCT TGTCGGCATA GACCGCGAGG 

401 GCTACCGCTT AGGACAGGCA GGCGGCTATT ACGATGCGAC GCTTGCGGCG 

4 51 ATGAAATACC GTTTGCAGGC AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 

501 GTTTGTGGAC AGGCTGCCGC GCGAACCGCA CGATCTGCTG CTGGACGGTT 

551 TTGTGTCGGA GGCGGGGATA TTGTGCTTTT AG 



This corresponds to the amino acid sequence <SEQ ID 654; ORF 157.a>: 

al57 .pep 

1 MRNEEKHALR RELRRARAQM GHQGRLAAGQ TINRLLKRYI KRGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA KLYLPYIEPR SRRMWFTPYP ESGMERERIR 

101 GRAKLNVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLAA 

151 MKYRLQAKTV GVG FACQFVD RLPREPHDLL LDGFVSEAGI LCF* 

ml57/al57 82.4% identity in 193 aa overlap 



10 20 30 40 50 60 

ml57 . pep MRNEEKRALRRELRGRRSQMGRDVRAAATVKINHLLKRYIKKGRKIGVYWPMGKELRLDG 
I 1 I I I I : I I II I I I I : I I I : : I El I I : II II I I I : I I I I I I I I I I I I I I I I M 
al57 MRNEEKHALRRELRRARAQMGHQGRLAAGQTINRLLKRYIKRGRKIGVYWPMGKELRLDG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml57 . pep FVRAAQKRGAELYLPYIEPRSRRMWFTPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 
I I i I I I II I I : I I I I I II I I I I I I I 1 I I 11 :!:::!! I I I I 1 I : I I I I I M I IN I 
al57 FVRAAQKRGAKLYLPYIEPRSRRMWFTPYPESGMERERIRGRAKLNVPQFAGRKIRVHGL 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml57 . pep NLLLVPWGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 

: : I I M : I I : I I II I I I I I I II I I i M : I I I I i I I I I ! I I I I I II I : I I I I I I II 
al57 SVLLVPLVGIDREGYRLGQAGGYYDATLAAMKYRLQAKTVGVGFACQFVDRLPREPHDLL 

130 140 150 160 170 180 



190 

ml57 .pep LDGFVSEAGILCFX 
I II I I I I I M I I I I 
a 157 LDGFVSEAGILCFX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 655>: 

gl58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGgagcAGTT GGAGAtggCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGCGT GAAcCTGCtc 

151 aACCGCACCA CGCGGCAACT CAATCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAGTA CCGCAAGGCG TGTTGCGCGT GGATTCCGCG 

301 ATGCcgatgg TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATT CAACG A 

351 ACGCTATCCG CATATCcgaC TTTCGCTCGT TTCTTCCGAa ggctatatca 

4 01 atctGattGA Acgcaaagtc gAtatTGCCT TACGGGCCGG AGAATTGGAC 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCACT TCCGCGtagt 

501 cgCCAGTCCT GAATATTTAG CAAAACACGG CACGCCACAA TCTGCAGAAG 

551 atcTTGCCAA CCATCAATGT TTAGGCTTCA CAGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGAtgC GCAGGGAAAT CCCTATAAAA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAAGtt 

7 01 gCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCACTGAA 

751 GGAAAGTTAA TTCCcctatt cgCCGAACAA ACCTCCAATA AAACACACCC 
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801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAACCTC CGCTTACGCG 
851 TATTTTTGGA TTTTTTAGTG AAGGAACTGG GAAAAAATAT GAATAGAACG 
901 AATACCAAAT AA 



This corresponds to the amino acid sequence <SEQ ID 656; ORF 158.ng>: 

gl58 -pep 

1 MKTNSEELTV FVQWESGSF SRA AEQLEMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLNLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEV PQGVLRVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSHFRVVASP EYLAKHGTPQ SAEDLANHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEILR SLC LSSCGIACLS DFLV DNDITE 

251 GKLIPLFAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV KELGKNMNRT 

301 NTK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 657>: 

ml58 . seq 

- 1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT GGCGATGGCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGTGT GAACCTGCTC 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG TGTTGAGCGT GGATTCCGCG 

301 ATGCCGATGG TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT TTCTTCCGAA GGCTATATCA 

401 ATCTGATTGA ACGCAAAGTC GATATTGCCT TACGGGCCGG AGAATTGGAC 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCGCT TCCGCGTAAT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG CACGCCGCAA TCTACAGAAG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA CCGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT CCCTATAAGA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAGGTT 

701 GCGGTATTGT TTGCTTATCA GATTTTTTGG TTGACAACGA CATCGCTGAA 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA ACCTCCGATA AAACACACCC 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAATCTC CGCTTACGCG 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG GAAACAATCT CTGTGGATAA 



This corresponds to the amino acid sequence <SEQ ID 658; ORF 158>: 

ml 58 .pep 

1 MKTNSEELTV FVQWESGSF SRAA EQLAMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLSLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEI PQGVLSVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSRFRVIASP EYLAKHGTPQ STEELAGHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEILR SLC LSGCGIVCLS DFLV DNDIAE 

251 GKLIPLLAEQ TSDKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml58 / gl58 94.3% identity in 2 97 aa overlap 

10 20 30 40 50 60 

ml 58 . pep MKTNSEELTVFVQVVESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
I I I I I I I I I I I I I I I I I I I I 1 I 1 1 ! I I I I 11 I I I I I t I I I I I I I I I M I M I I I II : I I 
gl58 MKTNSEELTV FVQVVESGSFSRAAEQLEMANSAVSRIVKRLEEKLGVNLLNRTTRQLNLT 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 58 . pep EEGAQYFRRAQRILQEMAAAETEMLAVHEIPQGVLSVDSAMPMVLHLLAPLAAKFNERYP 
I I I I I I I II I M I I I I I I I I I I I I I I M I : I [ I I I I I I I I I I II I I I I I M I I M I I I I 
gl58 EEGAQYFRRAQRILQEMAAAETEMLAVHEVPQGVLRVDSAMPMVLHLLAPLAAKFNERYP 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml 58 . pep HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
M I I I I I II M I I I 1 M t 1 I I I I I I I I I I I I I I I I I I I I I I I : I I t : I ! I I I I I I I M I I 
gl58 HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSHFRVVASPEYLAKHGTPQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml58 . pep STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
I : I : I I : I I I f j I II I I I I 1 I I I II I I I I I I I I I I I I I I I I t I I i I I I I I M : I I I : I I I 
gl58 SAEDLANHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSSCGIACLS 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml58 . pep DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 

I I M I I I I : I I I I I I I : I I I I I : II II I II I II I I I I I I I I M I I I I M I : I I I : I : 
gl58 DFLVDNDITEGKLIPLFAEQTSNKTHPFNAVYYSDKAVNLRLRVFLDFLVKELGKNMNRT 

250 260 270 280 290 300 

gl58 NTKX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 659>: 

al58 . seq 

1 AT G AAAAC C A ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT GGCGATGGCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGTGT GAACCTGCTC 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG TGTTGCGCGT GGATTCCGCG 

301 ATGCCGATGG TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT TTCTTCCGAA GGCTATATCA 

4 01 AT CTGATTGA ACGCAAAGTC GATATTGCCT TACGGGCCGG AGAATTGGAC 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCGCT TCCGCGTAAT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG CACGCCGCAA TCTACAGAAG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA CCGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT CCCTATAAGA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAGGTT 

7 01 GCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCGCTGAA 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA ACCTCCAATA AAACGCACCC 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAACCTC CGCTTACGCG 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG GAAACAATCT CTGTGGATAA 

This corresponds to the amino acid sequence <SEQ ID 660; ORF 158.a>: 

alSS.pep 

1 MKTNSEELTV FVQWESGSF SRA AEQLAMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLSLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEI PQGVLRVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSRFRVIASP EYLAKHGTPQ STEELAGHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEILR SLC LSGCGIACLS DFLV DNDIAE 

251 GKLIPLLAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 

ml58/al58 99.0% identity in 299 aa overlap 

10 20 30 40 50 60 

ml 58 . pep MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
I I I I I I I I I I I 1 I I I I I j I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I 1 I I I I 
al58 MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml58 . pep EEGAQYFRRAQRILQEMAAAETEMIAVHEIPQGVLSVDSAMPMVLHLLAPLAAKFNERYP 
I It I I I I I I I I I I I I I I I I I I I I I I I I I I I It I I I I I II I II I I I I I I I I M I I I I II I 
al58 EEGAQYFRRAQRILQEMAAAETEMLAVHEIPQGVLRVDS/^MPMVLHLLAPLAAKFNERYP 

70 80 90 100 110 120 
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130 140 150 160 170 180 

ml58 . pep HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
I I I I I I M I I I It 1 I I I I I I 1 I I I I I I I I I I M I I I I i I It M M I I i I I I I I I I I I I M 
al58 HIRLSLVSSEGYINLIERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml58 . pep STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
I i M I I I I I I I 1 J I I I I I I I I I i I I I I I I M I I I I I I I I 1 I I I I I i I I I M I 1 I I I : I I I 
a 1 5 8 STEELAGHQCLGFTE PGSLNT WAVLDAQGNP YKI S PHFTAS SGE I LRS LCLSGCG I ACLS 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml58 . pep DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 
I I I I I I I I I I I I I I I I I MM I : I II I II I II I I I I M II I I I I I I I I I ! I M I M I II I 
al58 DFLVDNDIAEGKLIPLLAEQTSNKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 

250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 66 1>: 

gl60. seq 

1 ATGGAcattc tgGACAAact ggtcgatCTC GCccaATTGA CGGGCAGTGC 

51 GGATGTGCAG TgcctTTTGG GCGGACAATG gcATGaaacc TTGCAACGCG 

101 AAGGGCTGGT AC AC AT T G T T ACGGCGGGCA GCGGTTATCT CTGCATCGAC 

151 GGCGAAACTT CCCCGCGTCC GGTCGGCACG GGCGATATTG TATTTTTCCC 

201 GCGCGGCTTG GGTCATGTGT TGAGCCACGA CGGAAAATAC GGAGAAAGTT 

251 TACAACCGGA CATACGACAA AACGGCACAT TTATGGTCAA ACAGTGCGGC 

301 AACGGGCTGG ATATGAGCCT GTTTTGCGCC CGTTTCCGCT ACGACACCCA 

351 CGCCGATTTG ATGAACGGGC TGCCGGAAAC CGTTTTTCTG AACATTGCCC 

4 01 ATCCAAGTTT GCAGTATGTG GTTTCAATGC TGCAACTGGA AAGCGAAAAA 

4 51 CCTTT GACGG GGACGGTTTC CGTGGTCAAC GCATTACCGT CCGTCCTGCT 

501 GGTGCTTATC CTGCGCGCCT ATCTCGAACA GGATAAGGAT GTCGAACTCT 

551 CGGGCGTATT GAAAGGTTGG CAGGACAAAC GTTTGGGACA TTTGATCCAA 

601 AAGGTGATAG ACAAACCGGA AGACGAATGG AATATTGACA AAATGGTTGC 

651 CGCCGCCAAT ATGTCGCGCG CGCAACTGAT GCGCCGCTTC AAAAGCCAAG 

701 TCGGACTCAG CCCGCACGCC TTTGTGAACC ATATCCGCCT GCAAAAAGGC 

7 51 GCATTGCTGC TGAAGAAAAC CCCGGATTCG GTTTTGGAGG TCGCGCTGTC 

801 GGTGGGCTTT CAGTCGGAAA CGCATTTCGG CAAGGCGTTC AAACGGCAAT 

851 ATCACGTTTC GCCGGGGCAA TACCGGAAAG AAGGCGGGCA AAAATAA 



This corresponds to the amino acid sequence <SEQ ID 662; ORF 160.ng> : 



gl60.pep 

1 MDILDKLVDL AQLTGSADVQ CLLGGQWHET LQREGLVHIV TAGSGYLCID 

51 GETSPRPVGT GDIVFFPRGL GHVLSHDGKY GESLQPDIRQ NGTFMVKQCG 

101 NGLDMSLFCA RFRYDTHADL MNGLPETVFL NIAHPSLQYV VSMLQLESEK 

151 PLTG TVSWN ALPSVLLVLI L RAYLEQDKD VELSGVLKGW QDKRLGHLIQ 

201 KVIDKPEDEW NIDKMVAAAN MSRAQLMRRF KSQVGLSPHA FVNHIRLQKG 

251 ALLLKKTPDS VLEVALSVGF QSETHFGKAF KRQYHVSPGQ YRKEGGQK* 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 663>: 

ml60 . seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 

51 GGATGTGCAG TGCCTTTTGG GCGGACAATG GTCGGTACGG CATGAAACCT 

101 TGCAACGCGA AGGATTGGTA CACATTGTTA CATCGGGCAG CGGCTATCTC 

151 TGCATCGACG GCGAAACTTC CCCGCGTCCG GTCAGTACAG GGGATATTGT 

201 ATTTTTCCCG CGCGGCTTGG GTCATGTGTT GAGCCACGAC GGAAAATGCG 

251 GAGAAAGTTT ACAACCGGAT ATGCGGCAGC ACGGTGCGTT TACGGTCAAG 

301 CAGTGCGGCA ACGGACAGGA TAT GAG C CT G TTTTGCGCCC GTTTCCGCTA 

351 CGACACCCAC GCCGATTTGA TGAACGGGCT GCCTGAAACC GTTTTTCTGA 

4 01 ACATTGCCCA TCCGAGTTTA CAGTATGTGG TTTCAATGCT GCAACTGGAA 

4 51 AGCAAAAAAC CTTTGACGGG GACGGTTTCC ATGGTCAACG CATTGTCGTC 
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501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG TTTGGGACAT 

601 TTAATCCAAA AGGTGATAGA CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 AATGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 AAAGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

751 CAAAAAGGCG CGTTGCTGCT GAAAAAAAAC CCGGATTCGG TTTTGTCGGT 

801 CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AggCGGGCAA 

901 AAATAA 



This corresponds to the amino acid sequence <SEQ ID 664; ORF 160>: 

ml 60 .pep 

1 MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR HETLQREGLV HIVTSGSGYL 
51 CIDGETSPRP VSTGDIVFFP RGLGHVLSHD GKCGESLQPD MRQHGAFTVK 
101 QCGNGQDMSL FCARFRYDTH ADLMNGLPET VFLNIAHPSL QYWSMLQLE 
151 SKKPLTG TVS MVNALSSVLL VLILRAYLEQ DKDVELSGVL KGWQDKRLGH 
201 LIQKVIDKPE DEWNVDKMVA AANMSRAQLM RRFKSRVGLS PHAFVNHIRL 
2 51 QKGALLLKKN PDSVLSVALS VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 
301 K* 

Computer analysis of this amino acid sequence gave the following results : 
Homology with a predicted ORF from K gonorrhoeae 

ml60 / gl60 93.4% identity in 301 aa overlap 

10 20 30 40 50 60 

MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
I | 1 I I I I I I : I I 1 I It : I I I I I I I I I i I I I I I I I I M I M i : I I I I I I I I t I 1 I I I I 

MDILDKLVDLAQLTGSADVQCLLGGQW HETLQREGLVHIVTAGSGYLCIDGETSPRP 

10 20 30 40 50 



ml 60 .pep 
gl60 



70 80 90 100 110 120 

ml 60 pep VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 
I : | | I ! I II I I I I M I M I I I I I M I I I I : I I : I : I I I 1 I I M I I I I I I I I I I I I I I 
g 160 VGTGDIVFFPRGLGHVLSHDGKYGESLQPDIRQNGT FMVKQCGNGLDMSL FCARFRYDTH 

60 70 80 90 100 110 

130 140 150 160 170 180 

ml 60 pep ADLMNGLPET V FLN I AH PSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVL I LRAYLEQ 

I I I I M I I I I I I I I M M I [ I I I I I I I I t II : I I I I I I f I : E M 1 I I I I I I I I I II I I I 
g l60 ADLMNGLPETVFLNIAHPSLQYWSMLQLESEKPLTGTVSVVNALPSVLLVLI LRAYLEQ 

120 130 140 150 160 170 

190 200 210 220 230 240 

ml 60 pep DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 

I | M I II I I M I I I M I I I I M I I 1 It I I I I I I I : M I I I I II II I II I I I I I I I : I I II 
g!60 DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNIDKMVAAANMSRAQLMRRFKSQVGLS 

180 190 .200 210 220 230 

250 260 270 280 290 300 

ml 60 pep PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

M I I I I I II I I I M I M I I : I I I I I I I I I I II I I I M 1 I M I I I M M I I I I I I I II I I 
g l60 PHAFVNHIRLQKGALLLKKTPDSVLEVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

240 250 260 270 280 290 



ml 60. pep KX 
I I 

gl60 KX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 665>: 

al60.seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 
51 GGATGTGCAG TGCCTTTTGG GCGGACAATG GTCGGTACGG CATGAAACCT 
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101 TGCAACGCGA AGGATTGGTA CACATTGTTA CATCGGGCAG CGGCTATCTC 

151 TGCATCGACG GCGAAACTTC CCCGCGTCCG GTCAGTACAG GGGATATTGT 

201 ATTTTTCCCG CGCGGCTTGG GTCATGTGTT GAGCCACGAC GGAAAATGCG 

251 GAGAAAGTTT ACAACCGGAT ATGCGGCAGC ACGGTGCGTT TACGGTCAAG 

301 CAGTGCGGCA ACGGACAGGA TATGAGCCTG TTTTGCGCCC GTTTCCGCTA 

351 CGACACCCAC GCCGATTTGA TGAACGGGCT GCCTGAAACC GTTTTTCTGA 

401 ACATTGCCCA TCCGAGTTTA CAGTATGTGG TTTCAATGCT GCAACTGGAA 

4 51 AGCAAAAAAC CTTTGACGGG GACGGTTTCC ATGGTCAACG CATTGTCGTC 

501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG T T T G GG AC AT 

601 TTAATCCAAA AG GT G AT AG A CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 AATGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 AAAGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

751 CAAAAAGGCG CGTTGCTGCT GAAAAAAAAC CCGGATTCGG TTTTGTCGGT 

801 CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AGGCGGGCAA 

901 AAATAA 

This corresponds to the amino acid sequence <SEQ ID 666; ORF 160.a>: 

al60. pep 

1 MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR HETLQREGLV HIVTSGSGYL 

51 CIDGETSPRP VSTGDIVFFP RGLGHVLSHD GKCGESLQPD MRQHGAFTVK 

101 QCGNGQDMSL FCARFRYDTH ADLMNGLPET VFLNIAHPSL QYWSMLQLE 

151 SKKPLTG TVS MVNALSSVLL VLIL RAYLEQ DKDVELSGVL KGWQDKRLGH 

201 LIQKVIDKPE DEWNVDKMVA AANMSRAQLM RRFKSRVGLS PHAFVNHIRL 

251 QKGALLLKKN PDSVLSVALS VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 

301 K* 

ml60/al60 100.0% identity in 301 aa overlap 

10 20 30 40 50 60 

MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
1 I I I M M I I I M I I I I I I I I I I I I I ! I I I I I I I I 1 M I I I It I M I I I I I 1 I I I I I M I 
MDILDKLVDFAQLTGSVDVQCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
10 20 30 40 50 60 

70 80 90 100 110 120 

VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSL FCARFRYDTH 

I I I I I I I ! M II I I I t I I I I ! I M I i It II M I 1 I I I I M I I I E I I I I M I M I I I I I I I 
VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSLFCARFRYDTH 
70 80 90 100 110 120 

130 140 150 160 170 180 

ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 
M I I I M I I M I I I t I I I I I I I 1 I 1 1 I I I I I 1 I i I I I I I I I M I II I M I I I I I i I I I M 
ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 
130 140 150 160 170 180 

190 200 210 220 230 240 

DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 

t M I I M I I I I I I I I I I I I I I M I I II I I II I 1 II II I I II I I M I I I I I I I II I I M I I 
DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 
190 200 210 220 230 240 

250 260 270 280 290 300 

PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

! | | I I I I I M I I I I I I I I I I I I M I I I I I I I I It I I I M I I I M I II I I I I M I I I M I I 
PHAFVNHIRLQKGALLLKKN PDSVLSVALS VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 
250 260 270 280 290 300 



ml 60 . pep 
al60 

ml 60 .pep 
al60 

ml 60 . pep 
al60 

ml 60 .pep 
al60 

ml 60 . pep 
al60 



ml 60 .pep 
al60 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 667>: 

gl61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTCACCGTTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTACGC TCGGTGCTGC CGCCGTATTG CGGCGCGACA CCTTCCGCAC 

201 GCCCCATTGG AAAAAC C AC T TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGAC AACCGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTttg GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

4 01 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

4 51 CCGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGCAACC GGCGTGGCGA TGTCGTCggt ttgggcgacg 

601 Ctgaccggct ggCACAcccT GTCCTTTcca tcggcagttt ATCtgtCGGG 

651 CATCGGCGTG tccgcgCtgA TTGCCCAaCT GtcgatgAcg cGCGcctaca 

7 01 aaGTCGGCGA CAAATTCACG GTTGCCTCGC tttcctaTAt gaccgtcGTC 

7 51 TTTTCCGCCC TGTCTGCCGC ATTTTTTCTg ggcgaagagc ttttctggCA 

801 GGAAATACTC GGTATGTGCA TCATTATcct CAGCGGCATT TTGAGCAGCA 

851 TCCGCCCCAT TGCCTTCAAA CAGCGGCTGC AAGCCCTCTT CCGCCAAAGA 

901 TAA 



This corresponds to the amino acid sequence <SEQ ID 668; ORF 161.ng>: 

gl61 -pep 

1 MDTAKKDILG SGWMLVAAA C FTVMNVLIKE ASAKFALGSG ELVFWRMLFS 

51 TVTLGAAAVL RRDT FRTPHW KNHLNRS MVG TGAMLLLFYA VTHL PLTTGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQA VL LLGFAGWLL LNPSF RSGQE 

151 PAALAGLAGG AMSGWAYLKV RELSLAGEPG WRVVFYLSAT GVAMSSVWAT 

201 LTGWHTLS FP SAVYLSGIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTW 

251 FSALSAAFFL GEELFWQ EIL GMCIIILSGI LSSI RPIAFK QRLQALFRQR 

301 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 669>: 

ml 61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTTACCATTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTGCGC TCGGGGCTGC CGCCGTATTG CGTCGGGACA mCTTCCGCAC 

201 GCCCCATTGG AAAAAC C ACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGGC CACTGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTTTG GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

4 51 ACGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGTGACA GGTGTGGCGA TGTCGTCGGT TTGGGCGACG 

601 CTGACCGGCT GGCACACCCT GTCCTTTCCA TCGGCAGTTT ATCTGTCGTG 

651 CATCGGCGTG TCCGCGCTGA TTGCCCAACT GTCGATGACG CGCGCCTACA 

7 01 AAGTCGGCGA CAAATTCACG GTTGCCTCGC TTTCCTATAT GACCGTCGTT 

7 51 TTTTCCGCTC TGTCTGCCGC ATTTTTTCTG GGCGAAGAGC TTTTCTGGCA 

801 GGAAATACTC GGTATGTGCA TCATCATCCT CAGCGGTATT TTGAGCAGCA 

851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 

901 TAA 



This corresponds to the amino acid sequence <SEQ ID 670; ORF 161>: 

ml 61 .pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAVL RRDX FRTPHW KNHLNRSMVG TGA MLLLFYA VTHLPLATGV 

101 TLSYTSSIFL AVFSFLILKE RISVYTQA VL LLGFAGWLL LNPSF RSGQE 
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151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG WR WFYLSVT GVAMSSVWA T 

201 LTGWHTLS FP SAVYLSCIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTVV 

251 FSALSAAFFL GEELFWQ EIL GMCIIILSGI LSSI RPTAFK QRLQSLFRQR 

301 * 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml61 / gl61 97.0% identity in 300 aa overlap 



10 20 30 40 50 60 

ml 61 . pep MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 
I I I I 1 I I I I I I 11 I I I It I II I : I I 1 I I I I I I I I I I I M I I II M I I I I I I I : I I M I I t 
gl61 MDTAKKDILGSGWMLVAAACFTVMNVLIKEASAKFALGSGELVFWRMLFSTVTLGAAAVL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 61 .pep RRDXFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 
I I I : I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I : I I I II I I I I II I I I I I I I I I I I I 
gl61 RRDTFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLTTGVTLSYTSSIFLAVFSFLILKE 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml61 . pep RISVYTQAVLLLGFAGVVLLLNPSFRSGQETAALAGLAGGAMSGWAYLKVRELSLAGEPG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I M 1 I I I I I M I M I I I 1 I I I II M 
g!61 RISVYTQAVLLLGFAGWLLLNPSFRSGQEPAALAGLAGGAMSGWAYLKVRELSLAGEPG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 61 . pep WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMT RAYKVGDKFT 

I I I I I I II : I ! M [ I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
gl61 WRWFYLSATGVAMSSVWATLTGWHTLSFPSAVYLSGIGVSALIAQLSMTRAYKVGDKFT 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml61 . pep VASLSYMTWFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 
I I I I I I I I i I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I II M I I I II : I I I M 
gl61 VASLSYMTWFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPIAFKQRLQALFRQR 

250 260 270 280 290 300 



ml 61. pep X 
I 

gl61 X 



The following partial DNA sequence was 

al61 . seq 

1 ATGGATACCG CAAAAAAAGA 

51 GGCGGCCTGC TTTACCATTA 

101 AATTTGCCCT CGGCAGCGGC 

151 ACCGTTGCGC TCGGGGCTGC 

201 GCCCCATTGG AAAAAC C AC T 

251 TGCTGCTGCT GTTTTACGCG 

301 ACCCTGAGTT ACACCTCGTC 

351 TTTGAAAGAA CGGATTTCCG 

401 TTGCCGGCGT GGTATTGCTG 

4 51 ACGGCGGCAC TCGCCGGGCT 

501 TTTGAAAGTG CGCGAACTGT 

551 TGTTTTACCT TTCCGTGACA 

601 CTGACCGGCT GGCACACCCT 

651 CATCGGCGTG TCCGCGCTGA 

7 01 AAGTCGGCGA CAAATTCACG 

751 TTTTCCGCTC TGTCTGCCGC 

801 GGAAATACTC GGTATGTGCA 



identified in N. meningitidis <SEQ ID 67 1>: 



CATTTTAGGA 
TGAACGTATT 
GAATTGGTCT 
CGCCGTATTG 
TAAACCGCAG 
GTAACGCATC 
GATTTTTTTG 
TTTACACGCA 
CTTAATCCCT 
GGCGGGCGGC 
CTTTGGCGGG 
GGTGTGGCGA 
GTCCTTTCCA 
TTGCCCAACT 
GTTGCCTCGC 
ATTTTTTCTG 
TCATCATCCT 



TCGGGCTGGA 
GATTAAAGAG 
TTTGGCGCAT 
CGTCGGGACA 
TATGGTCGGG 
TGCCTTTGGC 
GCGGTATTTT 
GGCGGTGCTG 
CGTTCCGCAG 
GCGATGTCCG 
CGAACCCGGC 
TGTCATCGGT 
TCGGCAGTTT 
GTCGATGACG 
TTTCCTATAT 
GCCGAAGAGC 
CAGCGGTATT 



TGCTGGTGGC 
GCATCGGCAA 
GCTGTTTTCA 
CCTTCCGCAC 
ACGGGGGCGA 
CACCGGCGTT 
CCTTCCTGAT 
CTCCTTGGTT 
CGGTCAGGAA 
GCTGGGCGTA 
TGGCGCGTCG 
TTGGGCGACG 
ATCTGTCGTG 
CGCGCCTACA 
GACCGTCGTT 
TTTTCTGGCA 
TTGAGCAGCA 
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851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 
901 TAA 

This corresponds to the amino acid sequence <SEQ ID 672; ORF 161.a>: 

al61.pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAV L RRDTFRTPHW KNHLNRSMVG TGA MLLLFYA VTHLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQ AVL LLGFAGWLL LNFSF RSGQE 

151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG WR WFYLSVT GVAMSSVWA T 

201 LTGWHTLS FF SAVYLSCIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTW 

251 FSALSAAFFL AEELFWQ EIL GMCIIILSGI LSSI RPTAFK QRLQSLFRQR 

301 * 

ml61/al61 99.3% identity in 300 aa overlap 

10 20 30 40 50 60 

MDTAKKDILGSGWMLVAAACFTIMNVLIKEAS7VKFALGSGELVFWRMLFSTVALGAAAVL 

| I I I I I I I I M M i I I I I I I I I I M I I I ! I M I I I M I I I I I I I I I I 

MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 
10 20 30 40 50 60 

70 80 90 100 110 120 

RRDXFRT PHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 
I I I : t I M I! I I (I I I! M I I I I I I I I I I M I I I I I I I I I M I 1 I 1 M M 1 1 1 I I M I I I 
RRDTFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 
70 80 90 100 110 120 

130 140 150 160 170 180 

RISVYTQAVLLLGFAGWLLLNPSFRSGQETAALAGLAGGAMSGWAYLKVRELSLAGEPG 

I | ! | | | | | I I I I I I I I I ! I M I I I I I M I I I I I ! I I I I I I M I I I I I I 11 I I I II 

RISVYTQAVLLLGFAGWLLLNPSFRSGQETAALAGLAGGAMSGWAYLKVRELSLAGEPG 

130 140 150 160 170 180 

190 200 210 220 230 240 

WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 

I I I 1 I I M I 11 II II I E I II I I I 1 I I i I 1 I M II I M I I 11 I I I I II I I I M I I I I I I I I 
WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 
190 200 210 220 230 240 

250 260 270 280 290 300 

VASLSYMTVVFSALSAAFFLGEELFWQEILGMCI I ILSGILSS IRPTAFKQRLQSLFRQR 

I I I I I I I | I I I I I I : I I I I I I I I I I II I 1 I I I II I I M I I I E I I I I II I I I I i I 

VASLSYMTW FSALSAAFFLAEELFWQEILGMCI I ILSGILSS IRPTAFKQRLQSLFRQR 
250 260 270 280 290 300 



X 
I 

X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 673>: 

gl63 . seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG gctCGACCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTt 

151 ctgGGTTTtc t get GAT ACT CTCGGTCAGC GGTTTGGGAA ACATcagGCT 

201 AGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGCC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA T G CAT T ATT T TTCGGACATT ACGGTCGGCG CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

401 CCTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 



ml 61 .pep 
al61 
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4 51 CGCTACAAAC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATGGGCTGG ATTGCCGAAA ACAGCTTCGG 

651 CGTGCAGGTC TTGATTATCG CCGCCGTAAT GTCCCTCGCC GTCGTTTCGG 

7 01 CAATATCCGG CGTGGGGAAG GGCGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGCCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG ACCCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTGAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGgc 

951 gcCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGg cgcaccatCc 

1001 gcgagtttgt CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAATG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAC TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CCCTGTTTTT TGTAACTTCT GCCGACTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGatgtcTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

14 01 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGCT 

14 51 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTCAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCGGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACATACC GCATCGCCCG 

1601 CTATGCACGA GTTGCAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGT CGATA AGATGTTTCA TCAGGACGAG CCCGCAATCG AGTTCGTCAT 

1701 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CACCAGACAA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1901 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 

This corresponds to the amino acid sequence <SEQ ID 674; ORF 163.ng>: 

gl63 .pep 

1 M VILTTLFFV CVLWLVL TV PDQVQMWLDR AKEVIFTEFS WFY VLTFSIF 

51 LGFLLILSVS GLGNIRLGRD EDVPEFGFLS WL AMLFAAGM GVGLMFFGVA 

101 EPLMHYFSDI TVGAPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLQEMGW IAENSFGVQ V LIIAAVMSLA VVSAIS GVGK GVKVLSE LNL 

251 GLAFLLLFFV LAAD PTVYLL SAFGDNIGNY LGNLVRLSLK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPELTSIVSL 

401 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PR WQAVMWGV LMSAVAVLLM 

4 51 RSGGLGNLQS M TLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVRIMSQTQE QDILKFLKHT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHQDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 675>: 

ml 63 . seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG GCTCGATCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTT 

151 CTGGGTTTCC TGCTGATACT CTCGGTCAGC AGTTTGGGAA ACATCAGGCT 

201 CGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGTC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGCCGGCA CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

4 01 CTTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 

4 51 CGCTACAAGC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATGGGCTGG ATTGCCGAAA ACAGCTTCAG 
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651 CGTGCAGGTT TTGATTATCG CCGCCGTCAT GTCCCTCGCC GTCGTTTCGG 

701 CAATATCCGG CGTGGGGAAG GGCGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGCCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG GACCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTTAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGGC 

951 GCCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGG CGCACCATCC 

1001 GCGAGTTTGT CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAATG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAT TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CTCTGTTTTT TGTAACTTCT GCCGATTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGATGTCTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

14 01 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGCT 

14 51 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTTAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCAGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACAGACT GCATCGCCCG 

1601 CTATGCACGA GTTGCAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AAATGTTTCA TCGGGACGAG CCCGCAATCG AGTTCGTCAT 

17 01 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

17 51 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CATCAGACAA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC G AC AT T T T G A 

1901 AAAACTACGA ACGTTATTTG ATGTTGTTGG ATGATGTCGG TCAGGAACTG 

1951 ATGGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 



This corresponds to the amino acid sequence <SEQ ID 676; ORF 163>: 

ml 63 . pep 

1 M VILTTLFFV CVLWLVL TV PDQVQMWLDR AKE VIFTEFS WFYVLTFSIF 

51 LGFLLILSVS SLGNIRLGRD EDVPE FGFLS WL AMLFAAGM GVGLMFFGV A 

101 EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLQEMGW IAENSFSVQ V LI I AA VMS LA WSAIS GVGK GVKVLSELNL 

251 GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPELTSIVSL 

4 01 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM . 

451 RSGGLGNLQS M TLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVQIMSQTQE QDILKFLKQT AS PAMHELQR ELSEEYGLSV 

551 RVDKMFHRDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml 6 3 / gl63 98.6% identity in 660 aa overlap 

10 20 30 40 50 60 

ml 63 .pep MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 
I I t I It E I I I I ! I I I I I II I 1! I I I 1 I I! I I i I I I I I I I M M M I I I I I I I I I I I I I I I 
gl63 MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 63. pep SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 
: I I I I 1 I I It t I I I I I I I I I It I I I I I M 1 I M I I 1 I I I I I i M i I I M II : I : I I I I I I 
g!63 GLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITVGAPEHRQQ 

70 80 90 100 110 120 



130 140 150 160 170 180 
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ml 63 . pep QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 

I ! 1 M I I I I M I I I I I 1 M t I M It I I M M I M I I I M ! I II I I M i M I I 1 I I I I I I I 
g 163 QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 63. pep MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 

II I I I I I I I II I I I I I II I I I I I M I I I I M I I I I I : I I I I I I I I I I I I I I I 1 I M M I I 
gl63 MALIATFFGIITTLGFGASQLGAGLQEMGWIAENSFGVQVLIIAAVMSLAWSAISGVGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 63 . pep GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 
! I I I I I i I I I I I I I I I I I I I I I I I II I I I I! I I II I I I I I I I I I I I I : I I I I I M I M ! 
g 163 GVKVLSELNLGLAFLLLFFVLAADPTVYLLSAFGDNIGNYLGNLVRLSLKTYAYEREHKP 

250 260 27 0 2 80 290 300 

310 320 330 340 350 360 

ml63 . pep WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 
I I I I I I M I I I M 1 I I I I I I I I i I I I I M I I I I I I I I I I M I M II I I I I I I I I I I I I I I 
g 163 WFESWTVLYWAWWCSWAPFVGLFIARISKGRTIREFVFGVLLIPGLFGVLWFTVFGNTAI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml63 . pep WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

I I I I M I I I I I I I I I II I I I I M I I I I I I I M I I I 1 I I I I I II I I I I I I I I I It I I I I I I 
g 163 WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 63 . pep ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

I I II I I I I ! M II I I I I I I I I I I I i I II I I I I I I I I I M M I I I II M I I I I I I I M I I I 
gl63 ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 63 . pep WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKQTAS PAMHELQR 

M I I I I I I II I I I I I I M I I I I I I I I II I I I I : I I I I I I I M II I i I I : I I I I I I I I I I I 
gl63 WKGLSADKKYFETRVNPTSVFWTGGPCWKERLVRIMSQTQEQDILKFLKHTASPAMHELQR 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 63. pep ELSEEYGLSVRVDKMFHRDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

I I I I M M I I I I I I I I I : I M M I I I I I I I II I I II I I I M M ! I I I I 1 I I I I I I I I I I I 
gl63 ELSEEYGLSVRVDKMFHQDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

550 560 57 0 580 590 600 

610 620 630 640 650 660 

ml 63 . pep HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQEL^47VHEQVELAE 

I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I II i I I M I I I 
g!63 HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 

610 620 630 640 650 660 



ml 63. pep X 
I 

gl63 X 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 677>: 

al63 . seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG GCTCGATCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTT 

151 CTGGGTTTCC TGCTGATACT CTCGGTCAGC AGTTTGGGAA ACATCAGGCT 
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201 CGGACGGGAT GAAGATGTGC CGGAATT CGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGTC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGAC ATT ACGGCCGGCA CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

4 01 CTTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 

451 CGCTACAAGC TGCCGCTTGC CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 

501 AGAAAAAATT TCCGGAAGGT TCGGCGATGC CATTGATATT ATGGCGTTGC 

551 TTGCTACTTT TTTCGGCATC ATCACCACAT TGGGGTTCGG GGCTTCGCAA 

601 CTGGGCGCCG GATTGCAGGA AATAGGCTGG ATTGCCGAAA ACAGCTTCAG 

651 CGTGCAGGTT TTGATTATCG CCGCCGTCAT GTCCCTCGCC GTCGTTTCGG 

701 CAATATCCGG CGTGGGGAAG GGTGTGAAGG TGTTGAGCGA GTTGAACCTG 

751 GGTCTTGCGT TTTTGCTGCT GTTTTTTGTT TTGGCGGCGG GTCCCACTGT 

801 TTACCTGTTG TCGGCATTCG GCGACAACAT AGGGAACTAC CTCGGAAATC 

851 TGGTGCGCCT CAGTTTTAAA ACTTATGCGT ACGAACGGGA ACACAAGCCG 

901 TGGTTTGAAT CTTGGACGGT GCTTTATTGG GCGTGGTGGT GTTCTTGGGC 

951 GCCGTTTGTG GGTTTGTTTA TCGCGCGCAT TTCAAAGGGG CGCACCATCC 

1001 GCGAGTTTGT CTTCGGGGTT TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 

1051 TGGTTTACCG TCTTCGGCAA TACGGCGATT TGGCTGAATG ACGGGGTTGC 

1101 GGGGGGAGTG CTCGAAAAGA TGACCTCCTC TCCGGAAACG CTGCTTTTTA 

1151 AATTCTTTAA TTACCTCCCC CTGCCCGAAT TGACGAGCAT CGTCAGCCTG 

1201 CTGGTCATTT CTCTGTTTTT TGTAACTTCT GCCGATTCCG GGATTTATGT 

1251 CCTGAACAAT ATTACCTCTC GGGACAAAGG CTTGAGCGCG CCACGGTGGC 

1301 AGGCGGTTAT GTGGGGCGTG CTGATGTCTG CCGTTGCCGT TTTGCTGATG 

1351 CGCTCGGGCG GACTCGGCAA CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 

14 01 GCCGTTTGCC CTGCTGATGC TGATAATGTG TTTCAGCCTG TGGAAAGGAT 

14 51 TGAGTGCGGA TAAGAAATAT TTTGAGACCC GGGTTAACCC TACCAGTGTA 

1501 TTTTGGACGG GCGGCAAGTG GAAAGAACGG CTGGTGCAGA TAATGAGCCA 

1551 GACGCAGGAG CAGGATATTT TAAAATTCCT CAAACATACC GCATCGCCCG 

1601 CTATGCACGA GTTACAACGG GAGCTTTCGG AAGAATACGG CTTGAGCGTC 

1651 CGGGTCGATA AGATGTTTCA TCAGGACGAG CCCGCAATCG AGTTCGTCAT 

17 01 TCGGAAAGAG ACGATGCGCG ATTTTATGTA CGGGATTAAG TCTGTCGGGC 

1751 AGGATGTATC CGACCAGTTG ATTAACGACG GCAAGCTGCC GCATATCCGG 

1801 CAT C AG AC AA CTTACAAACC CTACGCTTAT TTTTTCGACG GGCGCGTCGG 

1851 GTACGATGTG CAGTATATGA ACAAGGACGA GCTGATTGCC GACATTTTGA 

1901 AAAACT AC G A ACGTTATTTG ATGTTGTTGG AT GAT GT CGG TCAGGAACTG 

1951 AT GGCGCACG AGCAGGTGGA ATTGGCAGAG TAA 

This corresponds to the amino acid sequence <SEQ ID 678; ORF 163.a>: 

al63 .pep 

1 M VILTTLFFV CVLWLV.L TV PDQVQMWLDR AKEVIFTEFS WFYVLTFSIF 

51 LGFLLILSVS SLGNIRLGRD EDVPE FGFLS W LAMLFAAGM GVGLMFFGV A 

101 EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAI DI MALLATFFGI ITTLGF GASQ 

201 LGAGLQEIGW IAENSFSVQ V LIIAAVMSLA WSAIS GVGK GVKVLSE LNL 

251 GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGV LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 

401 LVISLFFVTS ADSGIYVLNN 1TSRDKGLSA PRW QAVMWGV LMSAVAVLLM 

4 51 RSGGLGNLQS MTLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVQIMSQTQE QDILKFLKHT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHQDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 

ml63/al63 99.4% identity in 660 aa overlap 

10 20 30 40 50 60 

ml 63. pep MVILTTLFFVCVLVVLVLTVPDQVQMWLDRAKEVI FTEFSWFYVLTFS I FLGFLLILSVS 
I I I I I M M I I I I ! I I I I I I I I I M M I I I I I I I I I 1 I I I f I f I I I I I I t II I I I I I It I 

al 63 MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVI FTEFSWFYVLTFS I FLGFLLILSVS 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 63 . pep SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 
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I I I ] M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M t I I I I I I I II I I I I I I I M I I I 
al63 SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 63 - pep QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 

II I I I I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I M I 1 t I M I I I I I I I I I I I I I It 
a 163 QALLHTVFHWGVHAWSVYGTIALALAYFGFRYKLPLALRSCFYPLLKEKISGRFGDAIDI 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 63 .pep MALLATFFGIITTLGFGASQLGAGLQEMGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 
I I I I I M I I I I I I I I I M I I I I I I I t I : I I I I I I I I I I M I I I I I I I I I M I I 1 I I I I t I 
al63 MALLATFFGIITTLGFGASQLGAGLQEIGWIAENSFSVQVLIIAAVMSLAWSAISGVGK 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 63 . pep GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 
I I I I I I I I I I II M I I I I I I It I I I I I I I I I I I I I I I I 11 I I I I I I I I I U I I M I I I I I 
a 163 GVKVLSELNLGLAFLLLFFVLAAGPTVYLLSAFGDNIGNYLGNLVRLSFKTYAYEREHKP 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 63. pep WFESWTVLYWAWWCSWAPFVGLFIARI SKGRTIREFVFGVLLI PGLFGVLWFTVFGNTAI 

I | M I | I I I I I I If I 1 I I I t I I I I t I I I I I I I I I M I I I I I I I I I I I I I I I I M I I M I I 
al63 WFESWTVLYWAWWCSWAPFVGLFIARI SKGRTIREFVFGVLLI PGLFGVLWFTVFGNTAI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml63 . pep WLNDGVAGGMLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

I | | | | | | | I : I II I I I I I I I M I I I I I I I I I I M I I II I I I I I I I t M I I I I 1 I I I I I I I 
al63 WLNDGVAGGVLEKMTSSPETLLFKFFNYLPLPELTSIVSLLVISLFFVTSADSGIYVLNN 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 63 . pep ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 
I I I I M I M I I I I I I I I I M I 1 M I I I I M I I I I M I M I t I I I I ! I I I I I t I 11 I M I I 
a 163 ITSRDKGLSAPRWQAVMWGVLMSAVAVLLMRSGGLGNLQSMTLIVSLPFALLMLIMCFSL 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 63. pep WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKQTAS PAMHELQR 

It I I I I I M I I I I I I M I I I I I I I It I I I I I I I It I M I ! I I : I I t I I I I I I I I 

al63 WKGLSADKKYFETRVNPTSVFWTGGKWKERLVQIMSQTQEQDILKFLKHTAS PAMHELQR 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 63 . pep ELSEEYGLSVRVDKMFHRDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 
I | M I I M I I I I M 1 I I : I M II I I I ! I I I I M I I I I I t I I I I t I I 1 I 1 I t I I I I I I I I I 
al63 ELSEEYGLSVRVDKMFHQDEPAIEFVIRKETMRDFMYGIKSVGQDVSDQLINDGKLPHIR 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 63 . pep HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 

I t I I 1 I I I i I I I t I It I M I M I I I I I 1 I I I I t I I I 1 It I I I II I I I I I I I I I I I I I I I I 
a 163 HQTTYKPYAYFFDGRVGYDVQYMNKDELIADILKNYERYLMLLDDVGQELMAHEQVELAE 

610 620 630 640 650 660 



ml 63 .pep 
al63 



X 
1 

X 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 679>: 

gl64.seq (partial) 

1 . . ATGAACACAT TTTTGAAAAA CAGCGAATAC GCGTATATCC TGAACGACTG 

51 CAAGGCGCGC TTCCTGTTCG CCTCGGCCGG CCTGTCAAAA GAATTGGCGG 

101 GCCTGAAGGC GCAAACGCCC GTCGAAAAAA TCATTTGGAC GGACAAAAGC 

151 CGGCCGGCCG GCGAAACGGC GGAAGGCGAT GCCTTTTTTG AAAACGTGCG 

201 CCGCTTCCCC GAAAAAGCCG ACTTGGGCCG CCAACCCCGG ATAAATGATT 

251 TGGCACACAT CATCTACACC TCCGGCACGA CGGGGCATCC CAAAGGCGCG 

301 CTAATCAGTT ACGCCAACCT GTTCGCCAAC CTGAACGGCA TCGAACGCAT 

351 CTTtaaAATT TCCAAACGCG ACCGCTTTAT CGTTTTCctg ccgatgTTCC 

4 01 ACAGCTTCAC GCTGACGGCT ATGGTGCTGC TGCCGATTTA TATGGCGTGT 

451 TCGATTATTT TGGTCAAAtC cgttttCCCc ttttccaacG TTTTGAAACA 

501 GGCCCTGCTC AAACGCGCAA CCGTGTTTTT GGGCGTACCC GCGATTTACA 

551 CCGCGATGAG CAAGGCAAAA ATCCCTTGGT ATTTCAGATG GTTCAACCGC 

601 ATCCGCCTGT TTATCAGCGG CGGCGCGCCT TTGGCGGAAC AAACCATCCT 

651 CGATTTTAAA GCCAAGTTCC CCCGCGCCAA ATTGCTGGAA GGCTACGGAC 

701 TGAGCGAAGC CTCGCCCGTC GTCGCCGTCA ATACGCCCGA ACGGCAAAAA 

751 GCCCGCAGCG TCGGCATCCC CCTGCCCGGT TTGGAAGCCA AAGCCGT CGA 

801 TGAAGAATTG GTCGAAGTGC CGCGCGGCGA AGTGGGCGAA CTGATCGTCA 

851 GGGGCGGTTC GGTGATGCGG GGCTACCTCA ATATGCCTGC CGCCACCGAT 

901 GAAACCATCG TCAACGGCTG GTTGAAAACG GGCGATTTCG TTACCATAGA 

951 CGAGGACGGC TTTATCTTTA TCGTCGACCG CAAAAAAGAT TTGATTATTT 

1001 CCAAAGGTCA AAACGTCTAT CCGCGCGAGA TCGAAGAAGA AATCCACAAA 

1051 CTCGATGCCG TCGAAGCCGC CGCCGTCATC GGCGTGAAAG ACCGTTATGC 

1101 CGACGAGGAA ATCGTCGCCT TCGTCCAATT GAAGGAAGGT ATGGATTTGG 

1151 GCGAGGACGA aatccgccgc caccTGCGTA CCGTGCTGGC AAATTT C AAA 

1201 ATCCCCAAAC AGATCCACTT TAAAGACGGG CTGCCGCGCA ACGCTACGGG 

1251 CAAAGTATTG AAACGGGTGC TGAAGGAGCA GTTTGAAGGA AACAAATGA 

This corresponds to the amino acid sequence <SEQ ID 680; ORF 164.ng>: 

gl64.pep (partial) 

1 . . MNTFLKNSEY AYILNDCKAR FLFASAGLSK ELAGLKAQTP VEKIIWTDKS 

51 RPAGETAEGD AFFENVRRFP EKPDLGRQPR INDLAHIIYT SGTTGHPKGA 

101 LISYANLFAN LNGIERIFKI SKRDRFIVFL PMFHSFTL TA MVLLPIYMAC 

151 SIILV KSVFP FSNVLKQALL KRATVFLGVP AIYTAMSKAK IPWYFRWFNR 

201 IRLFISGGAP LAEQTILDFK AKFPRAKLLE GYGLSEASPV VAVNTPERQK 

251 ARSVGIPLPG LEAKAVDEEL VEVPRGEVGE LIVRGGSVMR GYLNMPAATD 

301 ETIVNGWLKT GDFVTIDEDG FIFIVDRKKD LIISKGQNVY PRE IEEE I HK 

351 LDAVEAAAVI GVKDRYADEE IVAFVQLKEG MDLGEDEIRR HLRTVLANFK 

401 IPKQIHFKDG LPRNATGKVL KRVLKEQFEG NK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 681>: 

ml 64 . seq 

1 ATGAACCGGA CTTATGCCAA TTTCTACGAA ATGCTCGCCG CCGCCTGCCG 

51 CAAAAACGGA AACGGCACGG CAGTGTTCGA CGGCAAGGAA AAAACCGCCT 

101 ACCGCGCGCT CAAGCAGGAG GCCGAAGCCG TCGCGGCGTA TCTGCAAAAT 

151 ATCGGCGTGA AGTTCGGCGA CACGGTCGCG CTGGCGGTTT CCAATTCCAC 

201 AGAATTTATT ACCGCCTATT TCGCCATCTC CGCCATCGGC GCGGTCGCCG 

251 T AC C GAT G AA CACATTTTTG AAAAAC AG C G AATACGCGTA TATCCTGAAC 

301 GACTGCAAGG CGCGCTTCCT GTTCGCCTCG GCCGGCCTGT CAAAAGAATT 

351 GGCGGGCTTG AAGG C G C AAA CGCCCGTCGA AAAAAT C AT T TGGACGGACA 

4 01 AAAGCCGTCC GACCGGCGAA ACGGCGGAAG GCGATGCCTT TTTTGAAGAC 

4 51 GTGCGCCGCT TCCCCGAAAA ACCCGACTTG GGCCGCCAAC CCCGGATAAA 

501 TGATTTGGCA CACATCATCT ACACCTCCGG CACGACGGGG CATCCCAAAG 

551 GCGCGCTAAT CAGTTACGCC AACCTGTTCG CCAACCTGAA CGGCATCGAA 

601 CGCATCTTTA AAATTT CCAA GCGCGACCGC TTTATCGTTT TCCTGCCGAT 

651 GTTCCACAGC TTCACGCTGA CGGCTATGGT GCTGCTGCCG ATTTATATGG 

7 01 CGTGTTCGAT TATTTTGGTC AAATCCGTTT TTCCGTTTTC CAACGTTTTG 

751 AAACAGACAC TGCTCAAACG CGCGACCGTG TTTTTGGGCG TACCCGCGAT 

801 TTACACCGCG ATGAGCAAGG CGAAAATCCC TTGGTATTTC AGATGGTTCA 

851 ACCGCATTCG CCTGTTTATC AGCGGCGGCG CGCCTTTGGC GGAACAAACC 

901 ATCCTCGATT TCAAAGCCAA GTTCCCCCGC GCCAAATTGC TGGAAGGCTA 



WO 99/57280 



PCT/US99/09346 



466 



951 CGGACTGAGC GAAGCCTCTC CCGTCGTCGC CGTCAATACG CCCGAGAGGC 

1001 AAAAAGCCCG CAGCGTCGGC ATCCCCCTGC CCGGTTTGGA AGCCAAAGCC 

1051 GT CGAT GAAG AATTGGTCGA AGTGCCGCGC GGCGAAGTGG GCGAACTGAT 

1101 CGTCAGGGGC GGTTCGGTGA TGCGGGGCTA CCTCAATATG CCTGCCGCCA 

1151 CCGATGAAAC CATCGTCAAC GGCTGGTTGA AAACGGGCGA TTTCGTTACC 

1201 ATAGACGAAG ACGGCTTTAT CTTTATCGTC GACCGCAAAA AAGATTTGAT 

1251 TATTTCCAAA GGTCAAAATG TCTATCCGCG CGAGATTGAA GAAGAAATCT 

1301 ACAAACTCGA TGCCGTCGAA GCCGCCGCCG TCATCGGCGT GAAAGACCGT 

1351 TATGCCGACG AGGAAATCGT CGCCTTCGTC CAATTGAAGG AAGGTATGGA 

14 01 TTTGGGCGAG AACGAAATCC GCCGCCACCT GCGTACCGTG CTGGCAAATT 

14 51 TCAAAATCCC CAAACAAATC CACTTTAAAG ACGGGCTGCC GCGCAACGCT 

1501 ACGGGCAAGG TATTGAAACG GGTGTTGAAG GAGCAGTTTG ACGGAAACAA 

1551 ATGA 

This corresponds to the amino acid sequence <SEQ ID 682; ORF 164>: 

ml 64 -pep 

1 MNRTYANFYE MLAAACRKNG NGTAVFDGKE KTAYRALKQE AEAVAAYLQN 

51 IGVKFGDTVA LAVSNSTE FI TAYFAI SAIG AVAVP MNTFL KNSEYAYILN 

101 DCKARFLFAS AGLSKELAGL KAQTPVEKII WTDKSRPTGE TAEGDAFFED 

151 VRRFPEKPDL GRQPRINDLA HIIYTSGTTG HPKGALISYA NLFANLNGIE 

201 RIFKISKRDR FIVFLPMFHS FTL TAMVLLP IYMACSIILV KSVFPFSNVL 

251 KQTLLKRATV FLGVPAIYTA MSKAKIPWYF RWFNRIRLFI SGGAPLAEQT 

301 ILDFKAKFPR AKLLEGYGLS EASPWAVNT PERQKARSVG IPLPGLEAKA 

351 VDEELVEVPR GEVGELIVRG GSVMRGYLNM PAATDETIVN GWLKTGDFVT 

401 IDEDGFIFIV DRKKDLIISK GQNVYPREIE EEIYKLDAVE AAAVIGVKDR 

4 51 YADEEIVAFV QLKEGMDLGE NEIRRHLRTV LANFKIPKQI HFKDGLPRNA 

501 TGKVLKRVLK EQFDGNK* 



Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae ml64 / gl64 98.6% identity in 432 aa 

overlap 

60 70 80 90 100 110 

GDTVALAVSNSTEFI TAYFAI SAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSK 

I I I I I I I I I I I I I I i M I I I I I I M I I I M 
MNTFLKNSEYAY I LN DCKARFLFAS AGLSK 
10 20 30 



ml 64 .pep 
gl64 



120 130 140 150 160 170 

ml64 .pep ELAGLKAQTPVEKIIWTDKSRPTGETAEGDAFFEDVRRFPEKPDLGRQPRINDLAHIIYT 
I I I I I I I j j I I I I I I I I I I I I I : I M I I M 1 I f I : I I ! M I M 1 I I ! M 1 I M II I I I ! I 
g 164 ELAGLKAQTPVEKI IWTDKSRPAGETAEGDAFFENVRRFPEKPDLGRQPRINDLAHI I YT 

40 50 60 70 80 90 



180 190 200 210 220 230 

ml 64 .pep SGTTGHPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHS FTLTAMVLLP I YMAC 

I | | 1 | i I M I I I I I I It I I I t I I I I I I I I I I I I 1 I I I I M I M I M M 1 I II I i I II M I 
g 164 SGTTGHPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMAC 

100 110 120 130 140 150 



240 250 260 270 280 290 

ml 6 4 . pep S 1 1 LVKSVFPFSNVLKQTLLKRATVFLGVPAI YTAMSKAKI PWYFRWFNRIRLFI SGGAP 

I M I I II I I I M I I I I I : I I II I I I I I M I I I I I I I 11 I 1 I M I I I I I I i I I I II I M I I 
g 164 S I ILVKS VFPFSNVLKQALLKRATVFLGVPAI YTAMSKAKI PWYFRWFNRIRLFISGGAP 

160 170 180 190 200 210 



300 310 320 330 340 350 

ml64 .pep LAEQTILDFKAKFPRAKLLEGYGLSEASPWAVNTPERQKARSVGIPLPGLEAKAVDEEL 
I I I I M I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I 1 I I I I I I 
gl64 LAEQTILDFKAKFPRAKLLEGYGLSEASPWAVMTPERQKARSVGIPLPGLEAKAVDEEL 

220 230 240 250 260 270 

360 370 380 390 400 410 
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ml 64 .pep VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 
M I I I I I I I I I I ! I II I I I I I I i I I I I I I ! I M I I M I 1 I I M I I I I I I I I I I I I I I i M 
g 164 VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 

280 290 300 310 320 330 

420 430 440 450 460 470 

ml 64 . pep LIISKGQNVYPREIEEEIYKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRR 
I I I I I I I I I I I I I I I I II : i I I M I I I I I I I I II I I I I I I I I I I I I M I I I I I I I : I I I I 
g!64 L 1 1 SKGQNV Y PRE I EEE I HKLDAVE AAAVI GVKDRYADEE I VAFVQLKEGMDLGEDE I RR 

340 350 360 370 380 390 



480 490 500 510 

ml 64 . pep HLRTVLAN FKI PKQ I HFKDGLPRNATGKVLKRVLKEQFDGNKX 

I M I II II I M I I I I I 1 I I II M II I I I I M I I M I I I : I I I I 
g 1 6 4 HLRTVLAN FKI PKQ I HFKDGLPRNATGKVLKRVLKEQFEGNKX 

400 410 420 430 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 683>: 



al64 . seq 




1 


ATGAACCGGA 


51 


CAAAAACGGC 


101 


ACCGCGCGCT 


151 


ATCGGCGTGA 


201 


GGAATTTATT 


251 


TACCGAT GAA 


301 


GACTGCAAGG 


351 


GGCGGGCTTG 


401 


AAAGCCGTCC 


451 


GTGCGCCGCT 


501 


TGATTTGGCA 


551 


GTGCGCTAAT 


601 


CGCATCTTTA 


651 


GTTCCACAGC 


701 


CGTGTTCGAT 


751 


AAACAGGCAC 


801 


TTACACCGCG 


851 


ACCGCATCCG 


901 


ATCCTCGATT 


951 


CGGACTGAGC 


1001 


AAAAAGCCCG 


1051 


GTCGATGAAG 


1101 


CGTCAGGGGC 


1151 


CCGATGAAAC 


1201 


ATAGACGAAG 


1251 


TATTTCCAAA 


1301 


ACAAACT CGA 


1351 


TATGCCGACG 


1401 


TTTGGGCGAG 


1451 


TCAAAATCCC 


1501 


ACGGGCAAGG 


1551 


ATGA 



CTTATGCCAA TTTCTACGAA 
AACGGCACGG CAGTGTTCGA 
CAAGCAGGAA GCCGAAGCCG 
AGTTCGGCGA CACGGTCGCG 
ACCGCCTATT TCGCCGTATC 
CACATTTTTG AAAAACAGCG 
CGCGCTTCCT GTTCGCCTCG 
AAGGCGCAAA CGCCCGTCGA 
GGACGGCGAA ATGGCGGAAG 
TCCCCGAAAA ACCCGACTTG 
CACATCATCT ACACCTCCGG 
CAGCTACGCC AACCTGTTCG 
AAATCTCCAA GCGCGACCGC 
TTCACGCTGA CGGCTATGGT 
TATTTTGGTC AAATCCGTTT 
TGCTCAAACG CGCGACCGTG 
ATGAGCAAGA CGAAAATCCC 
CCTGTTTATC AGCGGCGGAG 
TCAAAGCCAA GTTCCCCCGC 
GAAGCCTCGC CCGTCGTCGC 
CAGCGTCGGC ATCCCCCTGC 
AATTGGTCGA AGTGCCGCGC 
GGTTCGGTGA TGCGGGGCTA 
CATCGTCAAC GGCTGGTTGA 
ACGGCTTTAT CTTTATCGTC 
GGTCAAAATG TCTATCCGCG 
TGCCGTCGAA GCCGCCGCCG 
AGGAAATCGT CGCCTTCGTC 
AACGAAATCC GCCGCCACCT 
CAAACAAATC CACTTTAAAG 
TATTGAAACG GGTGTTGAAG 



ATGCTGACCG CCGCCTGCCG 
CGGCAAGGAA AAAACCGCCT 
TTGCGGCGTA TCTGCAAAAT 
CTGGCGGTTT CCAATTCCAC 
CGCCATCGGC GCGGTTGCCG 
AATACGCGTA TATCCTGAAC 
GCCGGCCTGT CAAAAGAATT 
AAAAATCATT TGGACGGGCC 
GCGATGCCTT TTTTGAAGAC 
GGCCGCCAAC CCCGGATAAA 
CACGACGGGG CATCCCAAAG 
CCAACCTGAA CGGCATCGAA 
TTTATCGTTT TCCTGCCGAT 
GCTGCTGCCG ATTTATATGG 
TCCCCTTTTC CAACGTTTTG 
TTTTTGGGCG TGCCCGCGAT 
TTGGTATTTC AGATGGTTCA 
CACCTTTGGC GGAACAAACC 
GCCAAATTGC TGGAAGGCTA 
CGTCAATACG CCCGAGAGGC 
CCGGTTTGGA AGTCAAAGCC 
GGCGAAGTGG GCGAACTGAT 
CCTCAATATG CCTGCCGCCA 
AAACGGG CGA TTTCGTTACC 
GACCGCAAAA AAGATTTGAT 
CGAAATCGAA GAAGAAATCT 
TCATCGGCGT GAAAGACCGT 
CAATTGAAGG AAGGTATGGA 
GCGTACCGTG CTGGCAAATT 
ACGGGCTGCC GCGCAACGCT 
GAGCAGTTTG ACGGAAACAA 



This corresponds to the amino acid sequence <SEQ ID 684; ORF 164.a>: 

al64 -pep 

1 MNRTYANFYE MLTAACRKNG NGTAVFDGKE KTAYRALKQE AEAVAAYLQN 

51 IGVKFGDTVA LAVSNSTE FI TAYFAVSAIG AVAVP MNTFL KNSEYAYILN 

101 DCKARFLFAS AGLSKELAGL KAQTPVEKII WTGQSRPDGE MAEGDAFFED 

151 VRRFPEKPDL GRQPRINDLA HIIYTSGTTG HPKGALISYA NLFANLNGIE 

201 RIFKISKRDR FIVFLPMFHS FTL TAMVLLP IYMACSIILV KSVFPFSNVL 

251 KQALLKRATV FLGVPAIYTA MSKTKIPWYF RWFNRIRLFI SGGAPLAEQT 

301 ILDFKAKFPR AKLLEGYGLS EASPWAVNT PERQKARSVG IPLPGLEVKA 

351 VDEELVEVPR GEVGELIVRG GSVMRGYLNM PAATDETIVN GWLKTGDFVT 

401 IDEDGFIFIV DRKKDLIISK GQNVYPREIE EEIYKLDAVE AAAVI GVKDR 

4 51 YADEEIVAFV QLKEGMDLGE NEIRRHLRTV LAN FKI PKQ I HFKDGLPRNA 
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501 TGKVLKRVLK EQFDGNK+ 
ml 64 /al 64 98 . 3% identity in 517 aa overlap 

10 20 30 40 50 60 

ml 64 . pep MNRT YAN FYEMLAAACRKNGNGT AV FDGKEKT AYRALKQE AEAVAAYLQN IGVKFG DT VA 

I M I I I I M I I I : I I I I M I I I I I I I I t I ft I M I M I I I I II I M I I i I I I I I I I It I I 
al64 MNRTYANFYEMLTAACRKNGNGTAVFDGKEKTAYRALKQEAEAVAAYLQNIGVKFGDTVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 64 .pep LAVSNSTEFITAYFAISAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSKELAGL 
I I I I I I I I I I I M I I : I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I 1 I I 
al64 LAVSNSTEFITAYFAVSAIGAVAVPMNTFLKNSEYAYILNDCKARFLFASAGLSKELAGL 

70 80. 90 100 110 120 

130 * 140 150 160 170 180 

ml 64 .pep KAQTPVEKI IWTDKSRPTGETAEGDAFFEDVRRFPEKPDLGRQPRINDLAHI I YTSGTTG 

1 I I I I I I 11 I I I : I I I II I I I I I I I I I M M I I I I I I I I I I It I I I I I I I I I I I II I 
al64 KAQTPVEKI IWTGQSRPDGEMAEGDAFFEDVRRFPEKPDLGRQPRINDLAHI I YTSGTTG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml64 . pep HPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMACSIILV 
I I I I I I I I I M I I 1 I I I I I I M I II I I I I I I I II I I I I I I I I I I I I I I I II I M I I I I I I 
al64 HPKGALISYANLFANLNGIERIFKISKRDRFIVFLPMFHSFTLTAMVLLPIYMACSIILV 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 64 . pep KSVFPFSNVLKQTLLKRATVFLGVPAIYTAMSKAKIPWYFRWFNRIRLFISGGAPLAEQT 
I I I I I I I I I I I I : II I I I I I I I I I I M M I I I I : I I I I I I I I I I I I I I II I I 1 I I I I I I I 
al64 KSVFPFSNVLKQALLKRATVFLGVPAIYTAMSKTKIPWYFRWFNRIRLFISGGAPLAEQT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 64 . pep ILDFKAKFPRAKLLEGYGLSEAS PWAVNT PERQKARSVGIPLPGLEAKAVDEELVEVPR 

I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I! II I I I : I I I I I I I I I I I I 
al64 ILDFKAKFPRAKLLEGYGLSEAS PWAVNTPERQKARSVGIPLPGLEVKAVDEELVEVPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 64 . pep GEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKDLIISK 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I 
al64 GEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKDLIISK 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 64 . pep GQN VY PRE I EEE I YKLDAVE AAAVI GVKDRYADEE I VAFVQLKEGMDLGENE I RRH LRT V 

I f I I I I I M I I M I It I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II 
al64 GQN VYPRE I EEE I YKLDAVE AAAV I GVKDRYADEE I VAFVQLKEGMDLGENE IRRHLRTV 

430 440 450 460 470 480 

490 500 510 

ml 64 . pep LANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFDGNKX 
I I I I I I I I I I I I I I I I I I I I 11 I I I I 11 I I I I I I I I I I 
a 1 6 4 LAN FKI PKQI H FKDGL PRN ATGKVLKRVLKEQFDGNKX 

490 500 510 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 685>: 

gl65 . seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 
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51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gTGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctgGTCGC GGAAGGCAAG 

301 TTGGAagaCA ATTCCTTCAT CAATGCcgtg ccgcatatGT Ctttggtgat 

351 gAacgaagac cactgCCgtt acCTGCAAAA ACGCTATGAT GTGTTTAAAA 

4 01 CGCAGAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CtccgCTGAT TATGCGCGGC CGGgacgaaA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG Aaggcacgga tgtcgATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

7 51 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CcTGCTGGgC gAaTTGCgtt aa 



This corresponds to the amino acid sequence <SEQ ID 686; ORF 165.ng>: 

gl65 . pep 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALE S SNAWN 

51 NAGTGHSALC ELNYAPLGAD GVINPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI 

151 SDWAPLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

3 51 NMPLTKYLLG ELR* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 687>: 

ml65.seq (partial) 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

4 01 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 
4 51 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 
501 CGCCGCCAAC TACTCCGCCG AAGgTACGGA TGTCGATTTC GGACGGCTGA 
551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 
601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 
651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 
701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 
751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 
801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 
851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 
901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 
951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCGCTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAA . . . 



This corresponds to the amino acid sequence <SEQ ID 688; ORF 165>: 

ml 65. pep (partial) 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 
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2 01 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 
251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 
301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLCAGWA 
351 NMPLTK. . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ml65 / gl65 97.2% identity in 356 aa overlap 

10 20 30 40 50 60 

ml 65. pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I I I I I 1 I I I I I 1 I I I I I I I 1 If I I I I I I I I I I I I I ! I I I M I I I I I I I i I I I M ! I I I M 
gl65 MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 • 90 100 110 120 

ml 65. pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

I M I II I I I : I : I : I I I I I I I I I I M I I I I I M I I i I I M I I I ! I I I I I I It I I II I I I I 
gl65 ELNYAPLGADGVINPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml65 . pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 

M I I I I I I I : I I I I! I I II I If I I I I I I I I I M M : I I I I I I I I I I I I I ! I I I I II I I I J 
g 165 HCRYLQKRYDVFKTQKLFENME FSTDRNKISDWAPLIMRGRDENQPVAANYSAEGTDVDF 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 65 . pep GRLTRQMVKYLQGKGVKTEFNRHVED I KRESDGAWVLKTADTRNPDGQLT LRTRFLFLGA 

II I I It I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I II I II I I I I I I I I I I 
g 165 GRLTRQMVKYLQGKGVKTEFNRHVED I KRESDGAWVLKTADTRNPDWQLT LRTRFLFLGA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml65 . pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
M I I I I I I I It 1 I I I II I I 1 I : I I I I I I I I I I I I I t I I I I I I I I I I I I I M I I I I I I I I I 
g 165 GGGALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 

310 320 330 340 350 

ml 65 . pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTK 
I I I I I I i I I I I I I I I I I II I I I I I I I I I : I I II I M I I I I I I It I I I I M I I I I I 
g 165 DTRNVDGKRHLMFGPYAGFRSNFLKQGSFMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 

gl65 ELRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 689>: 

al65 . seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATT AT CG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGTTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

4 01 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

4 51 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 
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601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

7 01 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

7 51 TCCGGCATCC CCGAAGGCAA AGGCTACGGT GGCTTTCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCACTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAATACTA CCCCGAGGCA AACCCCGACG 

1151 ACTGGGAACT CATCACCGCA GGGCAACGCG TTCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGTT GCAGTTTGGT ACGGAGATTG TCGCACACGC 

1251 CGACGGCTCG CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 CCGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAACGCACC 

1351 CCGTCTTGGG AAGGCCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

1401 GTTGAACGAA AACCCCGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

1451 AAGTGTTGGA TATTTAA 



This corresponds to the amino acid sequence <SEQ ID 690; ORF 165.a>: 

al65 .pep 

1 MAE AT D WLV GGGIMSATLG VLL KELEFSW EITLIERLED VALE S SN AWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

4 01 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 

451 PSWEGRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 



ml65/al65 99.7% identity in 356 aa overlap 

10 20 30 40 50 60 

ml 65 . pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I I It I I I M I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I II I M I I I I 
al65 MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 65 . pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
I It I II I I I I I I I I I I I I I I I I I I I I I I I II I I II II i I I I 1 I I I I I I I I I I I I I M I I I 
al65 ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 65. pep HC S YLQKR YD AFKTQKLFENMEFSTDRNKISD WAP LMMRGRDENQPVAAN YSAEGTDVDF 

II I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I II I II I I I I I I I I M I I I I I I I I 
al65 HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAP LMMRGRDENQPVAAN YSAEGTDVDF 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 65 . pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLT LRTRFLFLGA 

I I I I I I I I I I I I i I I I M I I I I I I I I I I i M I I I I I I I I M I If I I I I I I I I I I I ! I II I 
al65 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 65. pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
II I I I I I M I I I I I I I I I 1 I I I J I I II I I I I I I I I I I I I I I II I I I I I I I I N I I I I M I 
al65 GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 
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310 320 330 340 350 

ml 65. pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTK 
I I I! II I I I I I I ! I I I I t I I I I I I I I I I I I I M I I ! I i I I M I I I I I I I M I I I I 
a!65 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 

al 65 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQI IKKDSEKGGVLQFGTEIVAHADGS 

370 380 390 400 410 420 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 69 1>: 

gl65-l.seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gTGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctggTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAACGAAGAC CACTGCCGTT ACCTGCAAAA ACGCTATGAT GTGTTTAAAA 

401 CGCAGAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CtccgCTGAT TATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTtt gCCTCCCTGC TGgaatacta cccGaggcag acccGACGAc 

1151 tggtactcat cacgcaggnc acGCGTcata tcattanata tgactCgaaa 

1201 ctgcgcgtgc tgcagttgta cgagattgtg ccaCGCGacg ctcgctcgcg 

1251 cattctggag cgtcgcggcg catcacgctn tgcgctgata tccgctgatg 

1301 acactgctcc gaGCGcgccc gtcttggaaa gtgtctga 



This corresponds to the amino acid sequence <SEQ ID 692; ORF 165-l.ng>: 

gl65-l.pep 

1 MAEAT DWLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAD GVINPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI 

151 SDWAPLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPRQ TRRLVLITQX TRHIIXYDSK 

401 LRVLQLYEIV PRDARSRILE RRGASRXALI SADDTAPSAP VLESV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 693>: 

ml 65-1. seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGTACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 
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851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCGCTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAATACTA CCCCGAGGCA AACCCCGACG 

1151 ACTGGGAACT CATCACCGCA GGGCAACGCG TCCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGCT CCAGTTTGGT ACGGAGATTG TCGCCCACGC 

1251 CGACGGCTCA CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 CTGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAGCGCGCC 

1351 CCGTCTTGGG AAGACCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

1401 GTTGAACGAA AACCCTGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

1451 AAGTATTGGA TATTTAA 

This corresponds to the amino acid sequence <SEQ ID 694; ORF 165-1>: 

ml 65-1. pep 

1 MAEATD VVLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLCAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

401 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERA 

451 PSWEDRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI * 



ml65-l/gl65-l 89.7% identity in 428 aa overlap 



10 20 30 40 50 60 

ml65-l pep MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

1 1 1 1 1 1 1 1 1 1 1 i i i i i i i t i i i i i t i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i t i i i i 

ql65-l MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml65-l pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
{ M | | M | I : I : I : I I I I f I I ! i ! I I I I 1 I I I I 1 i ! I I I I t t I t I I ! I I I i I t I I I I I I I 
Q165-1 ELNYAPLGADGVINPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml65-l pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAAN YSAEGTDVDF 
|| | | | I | I | : I I I I I M I I I I 1 M I I I I I I I I t t I : I i I I I I M I M I I I I ) I I I I 1 I I 
gl65-l HCRYLQKRYDVFKTQKLFENMEFSTDRNKISDWAPLIMRGRDENQPVAAN YSAEGTDVDF 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml65-l pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I | | | | | | i I I I I 1 1 1 I I I I H 1 t I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 
Q165-1 G RLT RQM VK Y LQGKG VKT E FN RH VE D I KRESDGAWVLKTADTRNPDWQLT LRTRFLFLGA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml65-l.pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

| | | | | | I | | | | | | I II I I II I : I M t I I I I I I I I I I I I I I ! I I I I I 1 I I I I I I I I I I M I 
gl65-l GGGALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml65-l pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 
| | | U | | | | | | I | I I I I I I I I I I I I I I I : I I I I I t I i I I t I I i I I I I I I I I I I I i I I I i 
gl65-l DTRNVDGKRHLMFGPYAGFRSNFLKQGSFMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml65-l.pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 

I I I I I I I I I I I t t t I I I I : • III I :lt It I III: : I ' 

gl65-l ELRKTKEERFASLLEYYPR-QTRRLVLITQXTR-HIIXYDS-KLRVLQLYEIVPRDARSR 
370 380 390 400 410 



430 440 450 460 470 480 
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ml 65-1. pep LAALLGASPGASTAVPLMIRLMHQCFPERAPSWEDRLKELVPGYGIKLNENPERADEI I A 
: II I 

gl 65- 1 ILERRGASRXALISADDTAPSAPVLESVX 
420 430 440 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 695>: 

al65-X.seq 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGTTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGGCACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGT GGCTTTCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCACTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAATACTA CCCCGAGGCA AACCCCGACG 

1151 ACTGGGAACT CATCACCGCA GGGCAACGCG TTCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGTT GCAGTTTGGT ACGGAGATTG TCGCACACGC 

1251 CGACGGCTCG CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 CCGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAACGCACC 

1351 CCGTCTTGGG AAGGCCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

14 01 GTTGAACGAA AACCCCGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

14 51 AAGTGTTGGA TATTTAA 

This corresponds to the amino acid sequence <SEQ ED 696; ORF 165-l.a>: 

al65-l .pep 

1 MAEAT DWLV GGGIMSATLG VLL KELEPSW EITLIERLED VALESSNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

401 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 

451 PSWEGRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI* 

al65-l/m!65-l 99.4% identity in 488 aa overlap 

10 20 30 40 50 60 

MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I 1 I I 1 M I I M I I I I I I I I M I I I 
MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
10 20 30 40 50 60 

70 80 90 100 110 120 

ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
| I I I I I I I I I I I I I I I I II I I 1 I I I I I I I I i I I I I ! I I I I I I I I I I II I I I I I II I I I I I 
ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
70 80 90 100 110 120 

130 140 150 160 170 180 

HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDF 

I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I 1 I I I I I I I I I II t I 

HCSYLQKRYDAFKTQKLFENMEFSTDRNKI SDWAPLMMRG RDENQPVAAN YSAEGTDVDF 
130 140 150 160 170 180 



al65-l -pep 
ml65-l 

al65-l .pep 
ml65-l 

al65-l -pep 
ml65-l 



190 200 210 220 230 240 
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al65-l - pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I i I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 
ml 65-1 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
190 200 210 220 230 240 

250 260 270 280 290 300 

al65-l .pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I t I t I I I 1 I I i I I I I I I I I I M I I I I I 
ml65-l GGGALTLLQKSG I PEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQAS VGAPPMSVPHL 

250 260 270 280 290 300 



310 320 330 340 350 360 

al65-l.pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 
I | I M I I I f I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I It I I I I M I 1 I I I I I I I I 
ml 65-1 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 

310 320 330 340 350 360 

370 380 390 400 410 420 

a 165-1 .pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 
M I 1 I I ! t I I I I I I 1 I I I I I I I I I I I i I I I I I I I I I I I II I i I I I I I I I I I I I I I I I I I I 
ml 65 - 1 ELRKTKEERFASLLEY YPEANPDDWELITAGQRVQI I KKDSEKGGVLQFGTEI VAHADGS 

370 380 390 400 410 420 



430 440 450 460 470 480 

a 165-1 .pep LAALLGASPGASTAVPLMIRLMHQCFPERTPSWEGRLKELVPGYGIKLNENPERADEIIA 
I I I I I I I I I t I t ! I I I I I I M I I I I I I I I : I I I I I i 1 I I I I I I i t i I I I I I I I I I I I I I 
ml 65-1 LAALLGASPGASTAVPLMIRLMHQCFPERAPSWEDRLKELVPGYGIKLNENPERADEIIA 

430 440 450 460 470 480 



489 

a!65-l.pep YTAKVLDIX 
I I I I I I I I i 
ml 65-1 YTAKVLDIX 

a!65-l/p33940 

sp|P33940|YOJH_ECOLI HYPOTHETICAL 60.2 KD PROTEIN IN ECO-ALKB INTERGENIC REGION 
>gi I 1736851 lgnl | PID| dl016718 (D90850) ORF_ID : o372#5 ; similar to [SwissProt Accession Number 
P33940] [Escherichia coli] >gi| 1788539 (AE000310) f548; This 548 aa ORF is 100 pet identical 
to 490 residues of YOJH_ECOLI SW: P3394 0 (4 92 aa) but contains 5 6 additional N-ter aa; 100 pet 
identical to GB: ECOHU49_33 

ACCESSION: U00008 (490 aa) but contains 58 aditional N-term resi... Length = 548 
Score = 458 bits (1167), Expect = e-128 

Identities = 233/490 (47%), Positives = 303/490 (61%), Gaps = 5/490 (1%) 



Query: 3 EATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALCEL 62 

+ TDV+L+GGGIMSATLG L+ELEP W +T++ERLE VA ESSN WNNAGTGHSAL EL 
Sbjct: 30 QETDVLLIGGGIMSATLGTYLRELEPEWSMTMVERLEGVAQESSNGWNNAGTGHSALMEL 89 

Query: 63 NYAPLGANGI I DPARALNIAEQFHVSRQFWATLVAEGKLED-NS FINAVPHMSLVMNEDH 121 

NY P A+G I +A+ I E F +SRQFWA V G L SFIN VPHMS V ED+ 
Sbjct: 90 NYTPQNADGSISIEKAVAINEAFQISRQFWAHQVERGVLRTPRSFINTVPHMSFVWGEDN 149 

Query: 122 CSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDFG 181 

++L+ RY A + LF M +S D +1 +WAPL+M GRD Q VAA + GTDV++G 
Sbjct: 150 VNFLRARYAALQQSSLFRGMRYSEDHAQIKEWAPLVMEGRDPQQKVAATRTEIGTDVNYG 209 

Query: 182 RLTRQMVKYLQGKG-VKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTXXXXXXXXXX 240 

+TRQ++ LQ K + + V +KR D W + AD +N Q 

Sbjct: 210 EITRQLIASLQKKSNFSLQLSSEVRALKRNDDNTWTVTVADLKNGTAQ-NIRAKFVFIGA 268 

Query: 241 XXXXXXXXQKSG I PEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQAS VGAPPMSVPHL 300 

Q+SGIPE K Y GFPV G F + NP+ H AKVYG+ASVGAPPMSVPH+ 
Sbjct: 269 GGAALKLLQESGIPEAKDYAGFPVGGQFLVSENPDVVNHHLAKVYGKASVGAPPMSVPHI 328 

Query: 301 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 360 

DTR +DGKR ++FGP+A F + FLK GSL DL S N+ PM+ G N L KYL+ 
Sbjct: 329 DTRVLDGKRVVLFGPFATFSTKFLKNGSLWDLMSSTTTSNVMPMMHVGLDNFDLVKYLVS 388 

Query: 361 ELRKTKEERFASLLEY YPEANPDDWELITAGQRVQI I KKDSEKGGVLQFGTE I VXXXXXX 420 

++ ++E+RF +L EYYP+A +DW L AGQRVQI IK+D+EKGGVL+ GTE+V 
Sbjct: 389 QVMLSEEDRFEALKEYYPQAKKEDWRLWQAGQRVQIIKRDAEKGGVLRLGTEVVSDQQGT 448 



Query: 421 XXXXXXXXXXXXXXVPLMIRLMHQCFPER — TPSWEGRLKELVPGYGIKLNENPERADEI 478 
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P+M+ L+ + F +R +P W+ LK +VP YG KLN + + 
Sbjct: 44 9 IAALLGASPGASTAAPIMLNLLEKVFGDRVSSPQWQATLKAIVPSYGRKLNGDVAATERE 508 

Query: 47 9 IAYTAKVLDI 488 

+ YT++VL + 
Sbjct: 509 LQYTSEVLGL 518 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 697>: 

g204 . seq 

1 atggcggcgg cggaaataaa acgccccctc gctgtcgatt tccagcacat 

51 agcgtccgtt ctgcacggcg gcatagccgc ttttgcctgc ctgatagggt 

101 tgcagggcgg aatgcgaaat caggtaatca gtcagtttgc cgccgtcttc 

151 ggcgatattg cccaccagtt tggcaaacaa ggtatggcac acgccgtttt 

201 ccgcccagcc cgaaggcgcg tcctttccgt cggtttccat acatttgccg 

251 acgacggctt ccaagtcgtt gggatgcttt ccggtcagcc ggacggcgtt 

301 ttgttccggc aagcctttaa tcggataact gatttgtttt ttgccgtcgt 

3 51 tggttttgcc ttcgctactt tgtcccaaag ccaaaccggc aatcgccgta 

401 ttgtcgatgt atttgacttt gaaaaccggt ttcggcgcgc tttgtgccgc 

451 attttgcggc tgttccgccg tattttcgga tttgccgcag gcggcaagca 

501 gcaggcagcc gcccaacacg gcaaaaggta ttttcagcat tccgcactcc 

551 tgatggtttc aaaatgccgt ctgaaatgcc gtctgaaacg tggcaggcgg 

601 aggttcggac ggcattgggt ttatttcaac gggcggatgc cgaccgcatc 

651 gcgtacttta tccaacaatt cgcgcgcttc tttgcgcgct ttttgcgcgc 

701 ctgcctgcaa aatctcttcg atttgcgaag gattagaggt caatgcgttg 

751 tag 

This corresponds to the amino acid sequence <SEQ ID 698; ORF 204.ng>: 

g204 -pep 

1 MAAAEIKRPL AVDFQ HIASV LHGGIAAFAC LIG LQGGMRN QVI SQFAAVF 

51 GDIAHQFGKQ GMAHAVFRPA RRRVLSVGFH TFADDGFQW GMLSGQPDGV 

101 LFRQAFN RIT DLFFAWGFA FATL SQSQTG NRRIVDVFDF ENRFRRALCR 

151 ILRLFRRIFG FAAGGKQQAA AQHGKRYFQH SALLMVSKCR LKCRLKRGRR 

201 RFGRHWVYFN GRMPTASRTL SNNSRASLRA FCAPACKISS ICEGLEVNAL 

251 * 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 699>: 



m204 . seq 




1 


ATGGCGGCGG 


51 


AGCGTCCGTT 


101 


TGCAGGGCGG 


151 


GGCGATATTG 


201 


CTGCCCAACC 


251 


CTGACGGCTT 


301 


TTGTTCCGGC 


351 


TGGTTTTGCC 


401 


TTGTCGATAT 


451 


GTTTTGCGGC 


501 


CAGGCAGCCG 


551 


GATGGTTTCA 


601 


ATCGGGTTCA 


651 


ATAATTCGCG 


701 


TCTTCGATTT 



This corresponds to the amino acid sequence <SEQ ID 700; ORF 204>: 

m2 04.pep 

1 MAAAEIKRPF AVDFQ HIASV LHGGIAAFAC LIG LQGGMRN *VIRQFAAVF 

51 GDIAHQFGKQ GMAHAVFC PT CRTVLIIGFH TFAADGFQVA GMLADQSDNI 

101 LFRQAFNR IT DLFFAWGFA FAAL SQIQTG NRRIVDIYDF ENRFRRALYR 

151 VLRLYRRIXG FAATAXQQAA AQYGKXXXQH STXLMVSKCR LKRGRRRFGR 

201 HRVHFNGRMP TASGTLSNNS RAS LRAFAAP ACKISSICEG SAVSSL* 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 204 shows 82.0% identity over a 250 aa overlap with a predicted ORF (ORF 204.ng) 
from N. gonorrhoeae: 

m204/g204 

10 20 30 40 50 60 

m204 pep MAAAEI KRPFAVDFQHI ASVLHGGIAAFACLIGLQGGMRNXVI RQFAAVFGDIAHQFGKQ 

1 1 1 1 1 1 i i 1 = 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 

MAAAE I KRPLAVDFQH I AS VLHGG I AAFACL I GLQGGMRNQV I SQ FAAVFGD I AHQ FGKQ 
10 20 30 40 50 60 



g204 



70 80 90 100 110 120 

m204 pep GMAHAVFCPTCRTVLI IGFHTFAADGFQVAGMLADQSDNT LFRQAFNRITDLFFAWGFA 

I [ 1 1 1 1 1 I I II ■■WWW llllhllh I hMIMMMIMMMIMM 

g2 04 GMAHAVFRPARRRVLSVGFHTFADDGFQWGMLSGQPDGVLFRQAFNRITDLFFAWGFA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 04 pep FAALSQ I QTGNRRI VD I YDFENRFRRALYRVLRLYRRIXGFAATAXQQAAAQYGKXXXQH 

||:||| I Ml , I I h--l I M I I : I M hllhlll I I I I : IIIMhll II 

g2 04 FATLSQSQTGNRRIVDVFDFENRFRRALCRILRLFRRIFGFAAGGKQQAAAQHGKRYFQH 

130 140 150 160 170 180 

190 200 210 220 230 

m204 pep STXLMVSKCRLK RGRRRFGRHRVH FNGRM P T AS GTL SNNS RAS LRAF AAP AC KI S S 

I 1 1 1 1 1 1 i 1 1 MIMMII hllllMIII MIIMIIIMM MINIM 

g2 0 4 SALLMVSKCRLKCRLKRGRRRFGRHWVYFNGRMPTASRTLSNNSRASLRAFCAPACKI SS 

190 200 210 220 230 240 

240 

m204.pep ICEGSAVSSLX 

I II I MM 
g2 04 I CEGLEVNAL 

250 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 701 >: 

a204 . seq 

1 ATGGCGGCGG CGGAAATAAA ACGCCCCCTC GCTGTCGATT TCCAGCACAT 

51 AGCGTCCGTT CTGCACGGCG GCATAGCCGC TTTTGCCTGC CTGATAGGGT 

101 TGCAGGGCGG AATGCGAAAT CAGGTAATCC GTCAGTTTGC CGCCGTCTTC 

151 GGCGATATTG CCCACCAGTT TGGCAAACAA GGTATGGCAC ACGCCGTTTG 

201 CCGCCCAGCC CGAAGGCGCG CCCTTTCCGT CGGTTTCCAT ACATTTGCCG 

251 ACGACGGCTT CCAAGTCGTT GGGATGCTTG CCGGTCAGCC GGACGACGTT 

301 TTGTTCCGGC AAGCCTTT 

351 

401 

451 

501 

55! AAGAG GTTCGGACGG 

601 CATTGGGTTT ATTTCAACGG GCGGATACCG ACCGCATCAC GTACTTTGCC 

651 CAATAATTCG CGTGCTTCTT TACGCGCTTT TTGCGCGCCT GCCTGCAAAA 

7 01 TCTCTTCGAT TTGCGAAGGG TCGGCGGTCA GCTCGTTGTA G 

This corresponds to the amino acid sequence <SEQ ID 702; ORF 204.a>: 

a204 .pep 

1 MAAAE IKRPL AVDFQH IASV LHGGIAAFAC LIG LQGGMRN QVIRQFAAVF 

51 GDIAHQFGKQ GMAHAVCRPA RRRALSVGFH TFADDGFQW GMLAGQPDDV 

101 LFRQAF 

151 KRFGR 

201 HWVYFNGRIP TASRTLPNNS RASLRAFCAP ACKISSICEG SAVSSL* 
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m204/a204 54.5% identity in 246 aa overlap 

10 20 30 40 50 60 

m204 .pep MAAAEIKRPFAVDFQHIASVLHGGIAAFACLIGLQGGMRNXVIRQFAAVFGDIAHQFGKQ 
I I I I I I I I I : 1 ! \ I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I II I I t 1 I I I I 
a20 4 MAAAEIKRPLAVDFQHIASVLHGGIAAFACLIGLQGGMRNQVIRQFAAVFGDIAHQFGKQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

m204 . pep GMAH A V FCPTCRTVLIIGFHT F AA DG FQ V AGM L ADQS DN I L F RQ A FN R I T DL FFA WG FA 

I I I I I I I : ! : t : I I I I I I I I I I I : I I I I I I : : I I I ! I I 
a20 4 GMAHAVCRPARRRALSVGFHTFADDGFQVVGMLAGQPDDVLFRQAF 

70 80 90 100 

130 140 150 160 170 180 

m2 0 4 . pep FAALSQIQTGNRRI VDI YDFENRFRRALYRVLRLYRRIXGFAATAXQQAAAQYGKXXXQH 

a204 



190 200 210 220 230 240 

m204 .pep STXLMVSKCRLKRGRRRFGRHRVHFNGRMPTASGTLSNNSRASLRAFAAPACKISSICEG 

: I I I I I I : I I I I : I I I I II II II I I 1 1 II I I I I I I I I I I I I 

a204 KRFGRHW VY FNGRI PTASRTLPNNSRASLRAFCAPACKI SS ICEG 

110 120 130 140 150 



m204.pep SAVSSLX 
I I I I I I I 

a20 4 SAVSSLX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 703>: 

g205 . seq 

1 atgctgaaaa taccttttgc cgtgttgggc ggctgcctgc tgcttgccgc 

51 ctgcggcaaa tccgaaaata cggcggaaca gccgcaaaat gcggcacaaa 

101 gcgcgccgaa accggttttc aaagtcaaat acatcgacaa tacggcgatt 

151 gccggtttgg ctttgggaca aagtagcgaa ggcaaaacca acgacggcaa 

201 aaaacaaatc agttatccga ttaaaggctt gccggaacaa aacgccgtcc 

2 51 ggctgaccgg aaagcatccc aacgacttgg aagccgtcgt cggcaaatgt 

3 01 atggaaaccg acggaaagga cgcgccttcg ggctgggcgg aaaacggcgt 
3 51 gtgccatacc ttgtttgcca aactggtggg caatatcgcc gaagacggcg 
401 gcaaactgac tgattacctg atttcgcatt ccgccctgca accctatcag 
451 gcaggcaaaa gcggctatgc cgccgtgcag aacggacgct atgtgctgga 
501 aatcgacagc gagggggcgt tttatttccg ccgccgccat tattga 

This corresponds to the amino acid sequence <SEQ ID 704; ORF 205 .ng>: 

g205 .pep 

1 MLKIPFAVLG GCLLLAA CGK SENTAEQPQN AAQSAPKPVF KVKYIDNTAI 

51 AGLALGQSSE GKTNDGKKQI SYPIKGLPEQ NAVRLTGKHP NDLEAWGKC 

101 METDGKDAPS GWAENGVCHT LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ 

151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 705>: 

m205 . seq 

1 ATGCTGAAwA CwTyTTTTGC CGTATTGGGC GGCTGCCTGC TGCyTtGCCG 

51 t GCGG CAAAT CCGwAAATAC GGCGGTACAG CCGCAAAACG CGGTACAAAG 

101 CGCGCCGAAA CCGGTTTTCA AAGT CAT AT A TATCGACAAT ACGGCGATTG 

151 CCGGTTTGGA TTTGGGACAA AGCAGCGAAG GCAAAACCAA CGACGGCAAA 

2 01 AAACAAATCA GTTATCCGAT TAAAGGCTTG CCGGAACAAA ATGTTATCCG 
251 ACTGATCGGC AAGCATCCCG GCGACTTGGA AGCCGTCAGC GGCAAATGTA 

3 01 TGGAAACCGA TGATAAGGAC AGTCCGGCAG GTTGGGCAGA AAACGGCGTG 

3 51 TGCCATACCT TGTTTG CCAA ACTGGTGGGC AATATCGCCG AAGACGGCGG 

4 01 CAAACTGACG GATTACCTAG TTTCGCATGC CGCCCTGCAA CCCTATCAGG 
4 51 CAGGCAAAAG CGGCTATGCC GCCGTGCAGA ACGGACG CTA TGTGCTGGAA 
501 ATCGACAGCG AAGGGGCGTT TTATTTCCGC CGCCGCCATT ATTGA 



WO 99/57280 



PCT/US99/09346 



479 



This corresponds to the amino acid sequence <SEQ ID 706; ORF 205>: 

m205 .pep 

1 MLXTXFAVLG GCLLX CRCGK SXNTAVQPQN AVQSAPKPVF KVIYIDNTAI 

51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPEQ NVIRLIGKHP GDLEAVSGKC 

101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 

151 AGKSGYAAVQ NGRYVLEIDS EGAFY FRRRH Y* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 205 shows 88.4% identity over a 181 aa overlap with a predicted ORF (ORF 205 .ng) 
from N, gonorrhoeae: 

m205/g205 

10 20 30 40 50 60 

m2 05 . pep MLXTXFAVLGGCLLXCRCGKSXNTAVQPQNAVQSAPKPVFKVIY I DNTAI AGLDLGQSSE 

II lllllllll MM III 1 1 1 1 1 ^ 1 1 1 1 1 1 i I ! i MIMIIIII IMIII 

g205 MLKIPFAVLGGCLLLAACGKSENTAEQPQNAAQSAPKPVFKVKYIDNTAIAGLALGQSSE 

10 20 30 40 50 60 



70 80 90 100 110 120 

m205 . pep GKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKCMETDDKDSPAGWAENGVCHT 

M M II M II M M I M II I h M I MlhIMM IMMM IhhIMMIMM 

g2 0 5 GKTNDGKKQI SYPI KGLPEQNAVRLTGKHPNDLEAWGKCMETDGKDAPSGWAENGVCHT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m205 - pep LFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNGRYVLEIDSEGAFYFRRRH 

II II II II M 1 1 II M I M I : I h M 1 1 II 1 1 1 M I II II II M II II II M I II 1 1 II I 

g205 LFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNGRYVLEIDSEGAFYFRRRH 

130 140 150 160 170 180 



m205.pep YX 
I 

g205 Y 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 707>: 

a205.seq (partial) 

1 TCCGAACCTC TTAAAGGCTT GCCGGAACAA AACGTCGTCC GGCTGACCGG 

51 CAAGCATCCC AACGACTTGG AAGCCGTCGT CGGCAAATGT ATGGAAACCG 

101 ACGGAAAGGG CGCGCCTTCG GGCTGGGCGG CAAACGGCGT GTGCCATACC 

151 TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG GCAAACTGAC 

201 GGATTACCTG ATTTCGCATT CCGCCCTGCA ACCCTATCAG GCAGGCAAAA 

251 GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA AATCGACAGC 

301 GAGGGGGCGT TTTATTTCCG CCGCCGCCAT TATTGA 

This corresponds to the amino acid sequence <SEQ ID 708; ORF 205. a>: 

a205.pep (partial) 

1 SEPLKGLPEQ NVVRLTGKHP NDLEAVVGKC METDGKGAPS GWAANGVCHT 

51 LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ AGKSGYAAVQ NGRYVLEIDS 

101 EGAFY FRRRH Y* 

m205/a205 88.3% identity in 1 1 1 aa overlap 

50 60 70 80 90 100 

m205 . pep KVIYIDNTAIAGLDLGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKC 

I I : I I I I I I I I : I I I J I I : I I I ! I IN 
a205 SEPLKGLPEQNWRLTGKHPNDLEAWGKC 

10 20 30 



110 120 130 140 150 160 

m205 . pep METDDKDSPAGWAENGVCHTLFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQ 
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MM I : 1 : I I I II I M I I II I I I I I 1 M I I 1 II II M : I I : I i I I I I I 1 I I I M I I I 
a205 METDGKGAPSGWAANGVCHTLFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQ 

40 50 60 70 80 90 



170 180 
m2 05 . pep NGRYVLEIDSEGAFYFRRRHYX 
M t I I I II I II I I I I I I I I I I I 
a 2 0 5 NGRYVLE I DSEGAFYFRRRHYX 

100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 709>: 

g205-l.seq (partial) 

1 ATGCTGAAAA TAcCTTTTGC CGTGTTGGGC GGCTGCCTGC TGCTTGCCGC 

51 CTGCGGCAAA TCCGAAAATA CGGCGGAACA GCCGCAAAAT GCGGCACAAA 

101 GCGCGCCGAA ACCGGTTTTC AAAGTCAAAT ACATCGACAA TACGGCGATT 

151 GCCGGTTTGG CTTTGGGACA AAGTAGCGAA GGCAAAACCA ACGACGGCAA 

201 AAAACAAATC AGTTATCCGA TTAAAGGCTT GCCGGAACAA AACGCCGTCC 

251 GGCTGACCGG AAAGCATCCC AACGACTTGG AAGCCGTCGT CGGCAAATGT 

301 ATGGAAACCG ACGGAAAGGA CGCGCCTTCG GGCTGGGCGG AAAACGGCGT 

351 GTGCCATACC TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG 

4 01 GCAAACTGAC TGATTACCTG ATTTCGCATT CCGCCCTGCA ACCCTATCAG 

451 GCAGGCAAAA GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA 

501 AATCGACAGC GAGGGGGCGT TTTA 



This corresponds to the amino acid sequence <SEQ ID 710; ORF 205-l.ng>: 

g205-l.pep (partial) . 

1 MLKIPFAVLG GCLLLAA CGK SENTAEQPQN AAQSAPKPVF KVKYIDNTAI 
51 AGLALGQSSE GKTNDGKKQI SYPIKGLPEQ NAVRLTGKHP NDLEAWGKC 
101 METDGKDAPS GWAENGVCHT LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ 
151 AQKSG YAA VQ NGRYVLEIDS EGAF 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 71 1>: 

xn205*l. seq. - 

1 ATGCTGAAAA CATCTTTTGC CGTATTGGGC GGCTGCCTGC TGCTTGCCGC 

51 CTGCGGCAAA TCCGAAAATA CGGCGGAACA GCCGCAAAAC GCGGTACAAA 

101 GCGCGCCGAA ACCGGTTTTC AAAGTCAAAT ATATCGACAA TACGGCGATT 

151 GCCGGTTTGG ATTTGGGACA AAGCAGCGAA GGCAAAACCA ACGACGGCAA 

201 AAAACAAATC AGTTATCCGA TTAAAGGCTT GCCGGAACAA AATGTTATCC 

251 GACTGATCGG CAAGCATCCC GGCGACTTGG AAGCCGTCAG CGGCAAATGT 

301 ATGGAAACCG ATGATAAGGA CAGTCCGGCA GGTTGGGCAG AAAACGGCGT 

351 GTGCCATACC TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG 

401 GCAAACTGAC GGATTACCTA GTTTCGCATG CCGCCCTGCA ACCCTATCAG 

451 GCAGGCAAAA GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA 

501 AATCGACAGC GAAGGGGCGT TTTATTTCCG CCGCCGCCAT TATTGA 



This corresponds to the amino acid sequence <SEQ ID 712; ORF 205- 1>: 

m205-l .pep 

1 MLKTSFAVLG GCLLLAA CGK SENTAEQPQN AVQSAPKPVF KVKYIDNTAI 
51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPEQ NVIRLIGKHP GDLEAVSGKC 
101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 
151 AGKSGYAAVQ NGRYVLE IDS EGAFYFRRRH Y* 



m205-l/g205-l 92,0% identity in 174 aa overlap 

10 20 30 40 50 60 

g205 - 1 . pep MLKI PFAVLGGCLLLAACGKSENTAEQPQNAAQSAPKPVFKVKYIDNTAIAGLALGQSSE 

III 1 I I I 1 I I I 1 1 I I I t I I I I I 1 1 I I I 1 I ^ i I I I I 1 I 1 1 I I f I 1 I I I I I I I 1 I I I I I 
m2 05 - 1 MLKTSFAVLGGCLLLAACGKSENTAEQPQNAVQSAPKPVFKVKYIDNTAIAGLDLGQSSE 
10 20 30 40 50 60 



70 80 90 100 110 120 

g205 - 1 . pep GKTNDGKKQI SYPIKGLPEQNAVRLTGKHPNDLEAWGKCMETDGKDAPSGWAENGVCHT 

I ! II I M I IIIIMil I lill-M lllhimi lllllll Ih I Ml II 1 1 1 1 II 

m2 0 5 - 1 GKTNDGKKQ I S YPI KGLPEQNVI RLI GKH PGD LEAVSG KCMETDDKDS PAGWAENGVCHT 
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70 80 90 100 110 120 

130 140 150 160 170 

g205-l.pep LFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNGRYVLEIDSEGAP 

M I II MM II MIMM I i MMMM! I I I I II Ml IMIIIII MM MM- 

tn2 0 5 - 1 LFAKLVGNI AEDGGKLTD YLVSHAALQPYQAGKSGYAAVQNGR YVLE IDSEGAF YFRRRH 

130 140 150 160 170 180 

m205-l YX 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 71 3>: 

a205-l.seq (partial) 

1 CCTCTTAAAG GCTTGCCGGA AC AAAACGT C GTCCGGCTGA CCGGCAAGCA 

51 TCCCAACGAC TTGGAAG CCG TCGTCGGCAA ATGTATGGAA ACCGACGGAA 

101 AGGGCGCGCC TTCGGGCTGG GCGGCAAACG GCGTGTGCCA TACCTTGTTT 

151 GCCAAACTGG TGGGCAATAT CGCCGAAGAC GGCGGCAAAC TGACGGATTA 

201 CCTGATTTCG CATTCCGCCC TG CAACCCT A TCAGGCAGGC AAAAGCGGCT 

251 ATGCCGCCGT GCAGAACGGA CGCTATGTGC TGGAAATCGA CAGCGAGGGG 

301 GCGTTTTATT TCCGCCGCCG CCATTATTGA 

This corresponds to the amino acid sequence <SEQ ID 714; ORF 205-l.a>: 

a205-l.pep (partial) 

1 PLKGLPEQNV VRLTGKHPND LEAWGKCME TDGKGAPSGW AAWGVCHTI/F 
51 AKLVGNIAED GGKLTDYLIS HSALQPYQAG KSGYAAVQNG RYVLEIDSEG 
101 AFYFRRRHY* 

m205»l/a205-l 89.0% identity in 109 aa overlap 

50 60 70 80 90 100 

m205 -1 . pep KYIDNTAIAGLDLGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKCME 

i : 1 1 1 1 ! ! 1 1 : I ! MIMMIIi Mil! 

a 2 0 5 - 1 PLKGIiPEQNWRLTGKHPNDLEAWGKCME 

10 20 30 

110 12 0 13 0 14 0 150 160 

m2 0 5 - 1 . pep TDDKDSPAGWAENGVCHTLFAKLVGNI AEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNG 

II I :hlll M IMIMMI II SMI M I i I 1 IIIIMIM II MI1IMI MUM I 

a2 0 5 - 1 TDGKGAPSGWAANGVCHTLFAKLVGNI AEDGGKLTDYLI SHSALQPYQAGKSGYAAVQNG 

40 50 60 70 80 90 

170 180 
m2 0 5 - 1 . pep RYVLEIDSEGAFYFRRRHYX 

Ml 1IIIM Ml II I1MII 

a2 0 5 - 1 RYVLEIDSEGAFYFRRRHYX 
100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 715>: 

g206 . seq 

1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca tcggcacgcc 

201 ctacaaatgg ggcggcagca gcaccgcaac cggcttcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

301 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

3 51 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 

401 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 

451 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 

501 ctaccttgga gcgcatacgt tttttacaga atga 

This corresponds to the amino acid sequence <SEQ ID 716; ORF 206.ng>: 

g206 .pep 

1 MFSPDKTLFL CLGALLLASC GTTSGKHRQP KPKQTVRQIQ AVRISHIGRT 
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51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 
101 ARDMAAASRK IPDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 
1S1 GKTIKTEKXiS TPFYAKNYLG AHTFFTE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 717>: 

m206 . seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

3 01 GCCCGCGACA TGGCGGCGGC AAGCCGsAAA ATCCCCGAcA GCCGCyTCAA 

3 51 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

451 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 718; ORF 206>: 

ra206 .pep . . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 
51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 
101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 
151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from K gonorrhoeae: 

m206/g206 



10 20 30 40 50 60 

m206 - pep MFPPDKTLFLCLS ALLLAS CGTTSGKHRQPKPKQTVRQ I QAVRI SHI DRTQGSQELMLHS 

II I I I I I ( I I I = I t I ( I f I I t I 1 I 1 1 I I I I i 1 I I I I 1 t 1 I I 1 I I t 1 I M 1 I I I M I M 
g206 MFS PDKTLFLCLGALLLASCGTTSGKHRQPKPKQTVRQI QAVRI SHIGRTQGSQELMLHS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m206 . pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKKALNVKLPRTARDMAAASRKIPDSRXKAGD 

I M 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 i 1 1 M * 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 i I II 1 1 1 1 1 1 1 

g206 LGLIGTPYKWGGSSTATGFDCSGMIQLVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 

70 80 90 100 110 120 



130 140 150 160 170 

m2 0 6 . pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

:|i!IMMIIMMIIIIIIMIIIIhIIIIIMNIIIIIIIIMMMIIIM 

g206 IVFFNTGGAHRYSHVGLYIGNGEFIHAPGSGKTIKTEKLSTPFYAKNYLGAHTFFTE 

130 140 150 160 170 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 719>: 

a206 , seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

4 51 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 
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This corresponds to the amino acid sequence <SEQ ID 720; ORF 206.a>; 

a206.pep 

1 MFPPDK TLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 99.4% identity in 1 77 aa overlap 

10 20 30 40 50 60 

m206 . pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQ^VVRISHIDRTQGSQELMLHS 
I I I II I I I I I I I I I ( I I I I I I I I I I M I I I I I I I I I M I I M I I I M I I I I I I I M I I I I 
a206 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISH1DRTQGSQELMLHS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m206.pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 
I I M I I I I t I I M M I I I I I It I It 11 1 I 1 I I I I I I 1 I ! I II I I I M I I I M I I I I I I I 
a206 LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

m206.pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
M M I I I I I I M I I I II I I I I II M I M 1 I I I 1 I M M M M I N II II I i II I I II I 
a206 LVFFNTGGAHRYSHVGLY I GNGE FIHAP S SGKT IKTEKLST PFYAKN YLGAHT FFTEX 

130 140 150 160 170 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 72 1>: 

g209.seq 

1 atgctgcggc atttaggaaa cgacttcgcc ttgggcgcgt tgtttttcga 

51 tgctgcggtt gatgtgccac tgctgggcga tggtcaggag gttgttgacc 

101 acccagtaga gaaccaaacc ggcagggaag aagaagaaca tgacggagaa 

151 aaccaacggc atgattttca tcattttcgc ctgcatcggg tcggtcggcg 

201 gcgggttcag ataggtttgg gcgaacatcg ttgccgccat aatgatgggc 

251 aggatgtagt aggggtcggc gcggctgagg tcggtaatcc agcccagcca 

301 aggtgcctgg cgcaattcta cggaggcgaa caatgcccag tacaagccga 

351 tgaagacggg gatttgcaac agcataggca gacagccgcc cagcgggttg 

401 atttcctcgt cttcgaaaag ctgcatcatc gcttgctgtt gcgccatacg 

451 gtcgtcgccg tatttttctt tgatggtctg cagttcgggt gcggcggcac 

501 gcattttcgc catcgaacgg taggaggcgt tggtcaatgg atacagtacg 

551 gctttgacga tgatggtcaa aacgacgatt gcccagcccc agttgccgat 

601 aatgttgtgc agttggttca ggagccagaa gagcggcgat gcgaaccagt 

651 gtactttacc gtagtctttt gccagttgca ggttgtcggc gatgtttgcg 

701 ataacggatg tggtttgcgg accggcatac aggttgaccg ccattttcgg 

751 ttttggcccc cgggttggga tagcggttaa 

This corresponds to the amino acid sequence <SEQ ID 722; ORF 209.ng>; 

g209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE WDHPVENQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDWGVG AAEVGNPAQP 

101 RCLAQFYGGE QCPVQADEDG DLQQHRQTAA QRVDFLVFEK LHHRLLLRHT 

151 VVAVFFFDGL QFGCGGTHFR HRTVGGVGQW IQYGFDDDGQ NDDCPAPVAD 

201 NWQLVQEPE ERRCEPVYFT WFCQLQWG DVCDNGCGLR TGIQVDRHFR 

251 FWPPGWDSG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 723>: 

m209 .seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGgGCGTT GTTTTTCGAT 

51 GCTGCGGTTG ATGTGCCATT GCTGGGCGAT GGTCAGGAGG TTGTTGACTA 

101 C CC AG T AC AA TACCAGACCG G C AGGGAAG A AGAAGAACAT GACGGAGAAA 

151 ACCAACGGCA TGATTTTCAT CATTTTCGCC TGCATCGGGT CGGTCGGCGG 

201 CGGGTTCAGA TAAGTTTGGG CGAACATCGT TGCCGCCATA ATGATGGGCA 

251 GGATGTAGTA GGGGTCGGCG CGGCTGAGGT CGGTAATCCA ACCCAGCCAA 
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301 GGTGCCTGGC GCAATTCTAC GGAGGCGAAC AATGCCCAAT ACAATCCGAT 

351 GAAGACGGGG ATTTGCAACA GCATAGGCAG GCAGCCGCCC AGCGGGTTGA 

4 01 TTTTCTCGTC TGTGTAAAGC TGCATCATCG CCTGTTGTTG CGCCATACGG 

451 TCGTCGCCGT ATTTCTCTTT GATGGCTTGC AGTTTGGGTG CGGCGGCACG 

501 CATTTTCGCC ATAGAGCGGT AAGAGGCGTT GGTCAATGGA TACAGTACGG 

551 CTTTGACGAT GATGGTTAAA ACGATAATCG CCCAGCCCCA GTTGCCGATG 

601 ATGTTGTGCA GTTGGTTCAG GAGCCAGAAG AGCGGGGhGG CGAACCAGTG 

651 TACTTTGCCG TAGTCTTTGG CCAGTTGCAG GTTGTCGGCG ATGTTTGCGA 

7 01 TGACGGATGT GGTCTGCGGG CCGGCGTAGA GGTTGATGGA GGCTTCGgTT 

751 TCGCGCCGTT TTGGATGGCG GCTAAAGGCA CGCTGACGCT GGTGCTGTAC 

801 AGCTTGTCGT TGCGGCGTTT GATGTCGATG TTGCACTCGC CTGCGGCGCA 

851 AACGCTTTGT CTGCCTTTAG GTTGGAGAAT CCAGGTGGAC ATGAAGTGGT 

901 GTTCAATCAT GCCGAGCCAG CCGGTCGGGG TTTTGCGGAT GTATTCGGCC 

951 TCGGATTTGC CGGATTTGGC ATCGTCGTCC AAGTCGGAAA AGCTGACTTT 

1001 TTGGAAGTTG CCTTCAGGGG TATAA 

This corresponds to the amino acid sequence <SEQ ID 724; ORF 209>: 

m209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE WDYPVQYQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ ISLGEHRCRH NDGQDWGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHRLLLRHT 

151 WAVFLFDGL QFGCGGTHFR HRAVRGVGQW IQYGFDDDG* NDNRPAPVAD 

201 DWQLVQEPE ERGGEPVYFA VVFGQLQVVG DVCDDGCGLR AGVEVDGGFG 

251 FAPFWMAAKG TLTLVLYSLS LRRLMSMLHS PAAQTLCLPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 209 shows 88.5% identity over a 253 aa overlap with a predicted ORF (ORF 209.ng) 
from N. gonorrhoeae: 

m209/g209 

10 20 30 40 50 60 

m209.pep MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDYPVQYQTGREEEEHDGENQRHDFHHFR 
I M M I M I i I 1 I I I I I 1 I I ! I I I I I M I I I I I : I 1 : M I 1 I I 1 I II II I I I I I I II I I 
g209 MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDHPVENQTGREEEEHDGENQRHDFHHFR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m209.pep LHRVGRRRVQISLGEHRCRHNDGQDVVGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 

I I I I I 1 I 11 I I : M 1 I I I I I II I M I I I I I M I II I I : II I 1 I I I I I I I I I ! I : I : I I I I 
g209 LHRVGRRRVQIGLGEHRCRHNDGQDWGVGAAEVGNPAQPRCLAQFYGGEQCPVQADEDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 0 9 . pep DLQQHRQAAAQRVDFLVCVKLHHRLLLRHT VVAVFLFDGLQFGCGGTHFRHRAVRGVGQW 

II I II M : I M I I I N I I I I I I I I I II I i I ! 1 h I I I I [ I I I I I ! I I I I I : I Mill 
g209 DLQQHRQTAAQRVDFLVFEKLHHRLLLRHTWAVFFFDGLQFGCGGTHFRHRTVGGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m20 9 . pep IQYGFDDDGXNDNRPAPVADDWQLVQEPEERGGEPVYFAWFGQLQWGDVCDDGCGLR 
It I M I I I I II: I I I I i I : 11 I II I I I I I I I I I I I : I I I M II I 1 I t I I : I I I I I 
g20 9 IQYGFDDDGQNDDCPAPVADNWQLVQEPEERRCEPVYFTVVFCQLQVVGDVCDNGCGLR 

190 200 210 220 230 240 

250 260 270 280 290 299 

ra209 . pep AGVEVDGGFG F-APFWMAAKGTLTLVLYSLSLRRLMSMLHSPAAQTLCLPLGWRIQVDMK 

: I : : I I II II 
g209 TGIQVDRHFRFWPPGWDSG 

250 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 725>: 

a209. seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGGCGCGT TGTTTTTCGA 
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51 TGCTGCGGTT GATGTGCCAT TGCTGGGCGA TGGTCAGGAG GTTGTTGATC 

101 ACCCAGTACA ATACCAGACC GGCAGGGAAG AAGAAGAACA TGACGGAGAA 

151 AACCAAAGGC ATGATTTTCA TCATTTTCGC CTGCATCGGG TCGGTCGGCG 

201 GCGGGTTCAG ATAGGTTTGG GCGAACATCG TTGCCGCCAT AATGATGGGC 

251 AGGATGTAGT AGGGGTCGGC GCGGCTGAGG TCGGTAATCC AACCCAGCCA 

301 AGGTGCCTGG CGCAATTCTA CGGAGGCGAA CAATGCCCAA TACAATCCGA 

351 TGAAGACGGG GATTTGCAAC AGCATAGGCA GGCAGCCGCC ChGCGGGTTG 

401 ATTTTCTCGT CTGTGTAAAG CTGCATCATG GCTTGTTGCT GCGCCATACG 

451 GTCGTCGCCG TATTTCTCTT TGATGGCTTG CAGTTTGGGC GCGGCGGCAC 

501 GCATTTTCGC CATCGAACGG TAAGAGGCGT TGGTCAATGG ATACAGTACG 

551 GCTTTGACGA TGATGGTTAA AACGATAATC GCCCAGCCCC AGTTGCCGAT 

601 GATGTTGTGC AGTTGGTTCA AAAGCCAAAA GAGGGGGGAG GCGAACCAGT 

651 GTACTTTGCC GTAGTCTTTG GCCAGTTGCA GGTTGTCGGC GATGTTTGCG 

701 ATAACGGATG TGGTCTGTGG GCCGGCGTAG AGGTTGATGG AGGCTTCGGT 

751 TTCGCACCGT TTTGGATAGC GGCTAAAGGC ACGCTGACGC TGGTGCTGTA 

801 CAGCTTGTCG TTGCGGCGTT TGATGTCGAT ACGGCAGTCG CCAGCGGCGC 

851 AAACGCTTTG TCCGCCTTTG GGTTGGAGGA TCCAGGTGGA CATGAAGTGG 

901 TGTTCAATCA TGCCGAGCCA GCCGGTCGGG GTTTTGCGGA TGTATTCGGC 

951 CTCGGATTTG CCGGATTTGG CATCGTCGTC CAAGTCGGAG AAGCTGACTT 

1001 TTTGGAAGTT GCCTTCAGGG GTATAA 

This corresponds to the amino acid sequence <SEQ ID 726; ORF 209.a>: 

a209.pep 

1 MLRHLGND FA LGALFFDAAV DVPLL GDGQE VVDHPVQYQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDWGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHGLLLRHT 

151 VVAVFLFDGL QFGRGGTHFR HRTVRGVGQW IQYGFDDDG* NDNRPAPVAD 

201 DWQLVQKPK EGGGEPVYFA WFGQLQWG DVCDNGCGLW AGVEVDGGFG 

251 FAPFWIAAKG TLTLVLYSLS LRRLMSIRQS PAAQTLCPPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

m209/a209 95.6% identity in 341 aa overlap 

10 20 30 40 50 60 

m20 9 . pep MLRHLGNDFALGALFFDAAVDVPLLGDGQEVVDYPVQYQTGREEEEHDGENQRHDFHHFR 
I I M I II I I M I 1 I I M I M I I I II M M I I i I : I M I I I I II M M I I I I I I M I I II I 
a209 MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDHPVQYQTGREEEEHDGENQRHDFHHFR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m20 9.pep LHRVGRRRVQISLGEHRCRHNDGQDVVGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 
I I I 1 II I I I I I : I M I M I I M I I I I i I I I M I I ! I I M II I I I I I II M I I I I I M M I 

a2 0 9 LHRVGRRRVQIGLGEHRCRHNDGQDVVGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m209 . pep DLQQHRQAAAQRVDFLVCVKLHHRLLLRHTVVAVFLFDGLQFGCGGTHFRHRAVRGVGQW 

I I I 1 II II I I I I I 1 I 1 I 1 I I 1 I I 1 I I I I I I I I I I I 1 I I M I I 11111111:1111111 
a2 0 9 DLQQHRQAAAQRVDFLVCVKLHHGLLLRHT VVAVFLFDGLQFGRGGTHFRHRTVRGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m209 . pep IQYGFDDDGXNDNRPAPVADDWQLVQEPEERGGEPVYFAWFGQLQWGDVCDDGCGLR 

II II I M M II II M I M M M M I I I : I : I I M I I I I I I M I I I I II I II II : I I I I 
a209 I QYGFDDDGXNDNRPAPVADDWQLVQKPKEGGGEPVYFAWFGQLQVVG DVCDNGCGLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

m20 9.pep AGVEVDGGFGFAPFWMAAKGTLTLVLYSLSLRRLMSMLHSPAAQTLCLPLGWRIQVDMKW 
I I I I I I I I I I I I I I I : M I I I I I I I I I I I I I I I I I I : : I I I I M I I I 1 I I I I I I I I I I 
a209 AGVEVDGGFGFAPFWIAAKGTLTLVLYSLSLRRLMSIRQSPAAQTLCPPLGWRIQVDMKW 

250 260 270 280 290 300 



m209.pep 



310 320 330 340 

CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKIiPSGVX 
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g211 . seq 




1 


atgttgcgga 


51 


ggtgggaaac 


101 


agtttttgga 


151 


gtaacggtag 


201 


agaagttacc 


251 


tgttcgcgca 


301 


ttcgacgaaa 


351 


cggcttccac 


401 


tcgcggctgc 


451 


aatcaggatg 


501 


gtttcaacgc 
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I ! I M I I! I I I !! I I I ! I I I I I I ! I 1 I ! I I I I I I I I I I I I ! I 
a209 CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKLPSGVX 
310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 727>: 

ttgctgctgc caatcagttg ggcggtcgaa atggtgcggc 

ggggtcgata agtttgggcg tggtgctgat aatcaggttg 

aggaaacctg attgtagtcg gcgcgtccgg gcgtgccgct 

ccgtggcgca attcgagcgt gcgtttgttg tccttcagcg 

ttctttggcg aagatgatgt tgtcgccgcc gtttttgtcc 

ggaacaggtt tttcatgatg ccggattcgg tgtcaaaggt 

taaaccctgc cgttgcgctt gcccaagtta ttgaactcgc 

caaagacaat tcctgcttct gcttcaaaat ttcggcatat 

gcagctctgc ccacggtatc acccaaagct gcatgacggc 
gcaaacggca cggcaaactg catgacgggg cgtatccact 
caatccgcag gatag 

This corresponds to the amino acid sequence <SEQ ID 728; ORF 21 l.ng>: 

g211 .pep 

1 MLRIAAANQL GGRNGAAVGN GVDKFGRGAD NQVEFLEG NL IWGASGRAA 

51 VTVAVA QFER AFWLQRE VT FFGEDDWAA VFVLF AQEQV FHDAGFGVKG 

101 FDEINPAVAL AQV IELAGFH QRQFLLLLQN FGIFAAAQLC PRYHPKLHDG 

151 NQDGKRHGKL HDGAYPLFQR QSAG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 729>: 

m211 .seq 

1 ATGTTGCGGG TTGCTGCTGC CAATCAGTTG GGCGGTCGGA ATGGTACGGC 

51 GGTGGGAAAC GGGGTCGATG AGTTTGGGCG TGGTGCTGAT AATCAGGTTG 

101 AGTTTTTGGA AGGAAACCTG ATTGTAGTCG GCGCGTCCGG GCGTGCCGCT 

151 GTAACGGTAG CCGTGGCGCA ATTCGAGCGT GCGTTTGTTG TCGTTCAGCG 

201 AGAAGTTACC TTCTTTGGCG AAGATGATGT TGTCGCCGCC GTTTTTGTCC 

251 TGTTCGCGCA GGAACAGGTT TTTCATGATG CCGGATTCGG TATCGAAGGT 

3 01 TTCGACAAAA TAAACCCTGC CGTTGCGCTT GCCCAAACTG TTGAACTCGC 
351 CTGCCTCCAC CAAAGACAAT TCCTGCTTCT GCTTCAGGAT TTCAG CGTAT 

4 01 TCGCGGCTGC GTAGCTCTGC CCACGGTATC ACCCAAAGCT GCATGACGGC 
451 AACCAAAACG GCAAACGGCA CGGCAAACTG CAT C ACCGGG CGTAT C CATT 
501 GTTTCAATGC CAATCCGCAg GATAG 

This corresponds to the amino acid sequence <SEQ ID 730; ORF 21 1>: 

m211 .pep 

1 MLRVAAANQL GGRNGTAVGN GVDKFGRGAD NQVEFLEG NL IWGASGRAA 

51 VTVAVA QFER AFVWQRE VT FFGEDDWAA VFVLFA QEQV FHDAGFG I E G 

101 FDKINPAVAL AQ TVELACLH QRQFLLLLQD FSVFAAAXLC PRYHPKLHDG 

151 NQNGKRHGKL HHRAYPLFQC QSAG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from jV. gonorrhoeae 

ORF 21 1 shows 89.1% identity over a 174 aa overlap with a predicted ORF (ORF 21 Lng) 
from N. gonorrhoeae: 

m211/g211 

10 20 30 40 50 60 

m2 11 . pep MLRVAAANQLGGRNGTAVGNGVDE FGRGADNQVE FLEGNL I WGAS GRAAVTVAVAQFER 

I I I = I I I I I I I I I I I : I I I I I I I II I i N I II II I I I I II I i I I I I I I I I I II I I II I I 
g2 1 1 MLRI AAANQLGGRNGAAVGNGVDKFGRGADNQVEFLEGNLI WGASGRAAVTVAVAQFER 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 1 1 . pep AFVWQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGI EGFDKINPAVALAQTVELACLH 
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g 2 1 1 AFWLQREVTFFGEDDVVAAVFVLFAQEQVFHDAGFGVKGFDE INPAVALAQVI ELAGFH 

70 80 90 100 110 120 

130 140 150 160 170 

m2 1 1 . pep QRQFLLLLQDFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQS AGX 

1 1 M li I M :|- i I M 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 M 1 1! MINI MM 

9211 QRQ FLLLLQNFG I FAAAQLC PRYHPKLHDGNQDGKRHGKLHDGAYPLFQRQS AG 

130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 73 1>: 

a211 . seq 

1 ATGTTGCGGG TTGCTGCTGC CAATCAGTTG GGCGGTCGGA ATGGTACGGC 

51 GGTGGGAAAC GGGGTCGATG AGTTTGGGCG TGGTGCTGAT AATCAGGTTG 

101 AGTTTTTGGA AGGAAACCTG ATTGTAGTCG GCGCGTCCGG GCGTGCCGCT 

151 GTAACGGTAG CCGTGGCGCA ATTCGAGCGT GCGTTTGTTG TCGTTCAGCG 

201 AGAAGTTACT TTCTTTGGCG AAGATGATGT TGTCGCCGCC GTTTTTGTCC 

251 TGTTCGCGCA GGAACAGGTT TTTCATGATG CCGGATTCGG TATCGAAGGT 

301 TTCGACAAAA TAAACCCTGC CGTTGCGCTT GCCCAAACTG TTGAACCCGC 

351 CTGCCTCCAC CAAAGACAAT TCCTGCTTCT GCTTCAGGAT TTCAGCGTAT 

4 01 TCGCGGCTGC GTAGCTCTGC CCACGGTATC ACCCAAAGCT GCATGACGGC 

4 51 AACCAAAACG GCAAACGGCA CGGCAAACTG CATCACCGGG CGTATCCATT 

501 GTTTCAATGC CAATCCGCAG GAT AG 

This corresponds to the amino acid sequence <SEQ ID 732; ORF 21 l.a>: 

a211.pep 

1 MLRVAAANQL GGRNGTAVGN GVDEFGRGAD NQVEFLEGN L IWGASGRAA 

51 VTVAVA QFER AFVWQRE VT FFGEDDWAA VFVLF AQEQV FHDAGFGIEG 

101 FDKINPAVAL AQTVE PACLH QRQ FLLLLQD FSVFAAA*LC PRYHPKLHDG 

151 NQNGKRHGKL HHRAYPLFQC QSAG* 

m21 l/a21 1 99.4% identity in 174 aa overlap 

10 20 30 40 50 60 

m2 1 1 . pep MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLI WGASGRAAVTVAVAQFER 

I M i I I M I I! I I I I M I I I I I M M I I I II I I I I II I It I I 11 II I I I I M II I I I I I I 
a2 1 1 MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLI WGASGRAAVTVAVAQFER 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 1 1 . pep AFWVQREVTFFGEDDVVAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVELACLH 
M I I I I I 1 I I I I M I I I I I M I I I I I I I 11 I I I I M M I I I I I I I M 11 I I I I I I I I 1 I 
a2 1 1 AFVWQREVT FFGEDDWAAVFVLFAQEQVFHDAG FG IEGFDKINPAVALAQTVE PACLH 

70 80 90 100 110 120 

130 140 150 160 170 

m2 1 1 . pep QRQFLLLLQDFS VFAAAXLCFRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

1 I I I I I I I I I I II I M I I I I I I I I I I I I I II I 1 I 1 I I 1 I I I I I II I I I I i I I 1 I I 

a211 QRQFLLLLQDFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 733>: 

g212.seq (partial) 

1 atggacaatc tcgtatggga cggcattccc gacatccgca cactcgacca 

51 aaccatccgc aaacacgcac acccgctcaa cctgattgtc tgcctccccg 

101 ataatcagat tcccgatttt caaaccgcac aagatgcttc ggactcggaa 

151 tgccgtctga agcaccgttt ggatcaggca acccagtgcc tccagttcga 

201 cagcatcaac ctcatcgaac acatcctgcc cgatgtccgc ttctggctgg 

251 ttcccccttc acgcacccgc cgcctgcacg aacacttcca ccacatttcc 

301 tggcagaccg aagccatccc gcaaaccgaa agcaagtccg acaaaccctg 

351 gtttgcactt ccacaaacat ccgaacggaa aaaaccggaa cacgtcctcg 
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4 01 tcatcggtgc aggcattgcc ggcgcatcga ccgcccacgc cttagcatca 

4 51 cacggcattt ccgttaccgt attggaagcc cgaaaagccg ctcaagccgc 

501 cagcggcaac cggcaagggc tgctttacgc caaaatctcg ccgcacgaca 

551 ccggacagac cgaactgctg cttgccggct acggctacac caaacgcctg 

601 ctcggacaca tcctgcccga ctccgacact tggggcggca acggcatcat 

651 ccacctcaat tacagccgca ccgaacaaca acgcaatcac gaattgggtt 

701 tgcaaaaaca ccataaccac ctctaccgca gcatcacgtc tgcagaagcc 

751 gaaaaaatcg ccggcatccc gctgaacacg ccctacgccg aaccattatg 

801 cggactctac tggcaacacg gcgtatggct caatccgccc gcattcgtcc 

851 gcaccctcct cagccatccg ctgatcgaac tatatgaaaa cacaacgtta 

901 accggcattt cccacgacgg agaaaagtgg attgcaagca cgccaaacgg 

951 cacatttacc gccacacaca tcatctactg caccggcgcg cacagcccct 

1001 gcctgcccga aaccaacctc gccgccctac ccctcaggca aatacgcgga 

1051 caaaccggcc tcacaccgtc caccccgttt tccgaacaac tgcgttgcgc 

1101 cgtttcaggc gaaagctaca tcagcccgtc gtggcacgga ctgcactgct 

1151 acggcgcgag ttttattccc aacagcagca ataccggatg gaacgaagcc 

1201 gaagaagcct caaaccgcca agcattggca caccttaacc ccgcccttgc 

1251 cgaatcattg ttt... 

This corresponds to the amino acid sequence <SEQ ID 734; ORF 212.ng>: 

g2l2.pep (partial) 

1 MDNLVWDGIP DIRTLDQTIR KHAHPLNLIV CLPDNQIPDF QTAQDASDSE 

51 CRLKHRLDQA TQCLQFDSIN LIEHILPDVR FWLVPPSRTR RLHEHFHHIS 

101 WQTEAIPQTE SKSDKPWFAL PQTSERKKPE HVLVIGAGIA GASTAHAIA S 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTGQTELL LAGYGYTKRL 

201 LGHILPDSDT WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITSAEA 

251 EKIAGIPLNT PYAEPLCGLY WQHGVWLNPP AFVRTLLSHP LIELYENTTL 

3 01 TGISHDGEKW IASTPNGTFT ATH I I YCTG A HSPCLPETNL AALPLRQIRG 
351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSNTGWNEA 

4 01 EEASNRQALA HLNPALAESL F. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 73 5>: 

m2l2.seq 

1 ATGGACAATC TCGTATGGGA CGGCATTCCC GACATCCGCA CACTCGACCA 

51 AGCCATCCGC AAACACGCAC CCCCGCTCAA CCTGATTATC TGCCTCCCCG 

101 ATAATCAGAT TCCCGATTTT CAAACCGCAC AAGATGCTTC GGACGCGGAA 

151 TGCCGTCTGA AGCACCGTTT GGATCAGGCA ATGCAGTGCC TCCAGTTCGA 

201 CAGCATCAAC CTCATCGAAC ACATCCTGCC CGATGTCCGC TTCTGGCTGG 

251 TTCCCCCTTC ACGCACCCAC CACCTGCACG AACATTTCCA CCACATTTCC 

301 TGGCAGACCG AAGCCATCCC GCAAACCGAA AGCAAGCCCG ACAAACCCTG 

3 51 GTTTGCACTT CCACAAACAT CCGAACGGCA AAAACCGGAA CACATCCTCG 
401 TTATCGGCGC GGGCATATCC GGCGCGGCAA CCGCCCACGC CTTAGCATCA 

4 51 CACGGCATTT CCGTTACCGT ATTGGAAGCC CGAAAAGCCG CCCAAGCCGC 
501 CAGCGGCAAC CGCCAAGGGC TGCTCTACGC CAAAATCTCG CCGCACGACA 
551 CCGAACAGAC CGAACTTTTG CTTGCCGGCT ACGGCTACAC CAAACGCCTG 
601 CTCGGACACA TCCTGCCCGA ATCCGAAACC TGGGGCGGCA ACGGCATCAT 
651 CCACCTCAAT TACAGCCGCA CCGAACAACA ACGCAATCAC GAATTGGGTT 
701 TGCAAAAACA CCATAACCAC CTCTACCGCA GCATCACATC TGCAGAAGCC 
751 GAAAAAATCG CCGGTATCCC ACTGTCCGTC CCATACGACC ACCCTTCATG 
801 CGGACTCTAC TGGCAACACG GCGTATGGCT CAATCCACCC GCATTCGTCC 
851 GCACCCTCCT CAACCATCCG CTCATTGGAC TACACGAAGA CACACCCTTG 
901 ACCGACATTT CCCACGACGG GGaAAAGTGG ATTGCAAGCA CGCCAAACGG 
951 CACATTTACC GCCACACACA TCATCTACTG CACCGGTGCG AACAGCCCCT 

1001 ACCTACCCGA AACCAACCTC GCCGCCCTGC CTCTCAGGCA AATACGCGGA 

1051 CAAACCGGCC TCACACCGTC CACCCCGTTT TCCGAACAAC TGCGTTGCGC 

1101 CGTTTCAGGC GAAAGCTACA TCAGCCCGTC GTGGCACGGA CTGCACTGCT 

1151 ACGGCGCGAG TTTTATTCCC AACAGCAGCC ATACCGGATG GAACGAAGCC 

1201 GAAGAAGCCT CAAACCGCCA AGCATTGGCA CACCTTAACC CCGCCCTTTC 

12 51 CGAATCATTG TTTGCCGCCA ACCCAAACCC CCAAAAACAC CAAGGGCACG 

13 01 CCGCCATACG CTGCGACAGC CCCGACCACC TTCCCCTAGT CGGCGCACTC 

13 51 GGCGACATTG CCGCCATGCG GCAGACCTAC ACCAAACTCG CGCTGGACAA 

14 01 AAACTACCGC ATCGACACCC CATGCCCATA CCTGCCTAAT GCCTACGTCA 
14 51 ACACCGCGCA CGGCACCCGC GGACTCGCCA CCGCCCCCAT CTGCGCCGCC 
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1501 GmCAwTG C AG CCCAAATCsT AGGCyTGCCC CATCCCTTTT yAcAAcGCCT 
1551 gCGCCACGCC cTAcACCCCA ACCGCACCAT CATCCGCGCC ATCGTCAGAA 
1601 GGAAGGATCT AACCCCTTAA 

This corresponds to the amino acid sequence <SEQ ID 736; ORF 212>: 

m212 -pep 

1 MDNLVWDGI P DIRTLDQAIR KHAPPLNLII CLPDNQIPDF QTAQDASDAE 

51 CRLKHRLDQA MQCLQFDSIN LIEHILPDVR FWLVPPSRTH HLHEHFHHIS 

101 WQTEAIPQTE SKPDKPWFAL PQTSERQKPE HILVIGAGIS GAATAHALA S 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGHILPESET WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITSAEA 

251 EKIAGIPLSV PYDHPSCGLY WQHGVWLNPP AFVRTLIiNHP LIGLHEDTPL 

301 TDISHDGEKW IASTPNGTFT ATHIIYCTGA NSPYLPETNL AALPLRQIRG 

351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 

4 01 EEASNRQALA HLNPALSESL FAANPNPQKH QGHAAIRCDS PDHLPLVGAL 

451 GDIAAMRQTY TKLALDKNYR IDTPCPYLPN AYVNTAHGTR GLATAPICAA 

501 XXAAQIXGLP HPFXQRLRHA LHPNRTI I RA IVRRKDLTP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 212 shows 92.9% identity over a 421 aa overlap with a predicted ORF (ORF 212.ng) 
from TV. gonorrhoeae: 

m212/g212 

10 20 30 40 50 60 

m2 1 2 . pep MDNLVWDGI PD I RTLDQAI RKHAPPLNLI I CLPDNQI PDFQTAQDASDAE CRLKHRLDQA 

1 1 1 1 1 1 1 1 1 1 i II 1 1 1 h 1 1 1 II 1 1 M h 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 - 1 1 II M 1 1 1 1 1 

g2 1 2 MDNLVWDGI PDIRTLDQTIRKHAHPLNLI VCLPDNQI PDFQTAQDASDSECRLKHRLDQA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 12 . pep MQCLQFDS INL I EHI LPDVRFWLVPPSRTHHLHEHFHHI SWQTEAI PQTESKPDKPWFAL 

I II I I I i I I I I I I I I I I I I I II I I I I I I : : II I I I I I I I I I I I I I I I II I I 11)1111 
g212 TQCLQFDSINLIEHILPDVRFWLVPPSRTRRLHEHFHHI SWQTEAI PQTESKSDKPWFAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m212 . pep PQTSERQKPEHILVIGAGI SGAATAHALASHGI SVTVLEARKAAQAASGNRQGLLYAKIS 

I M 1 1 h M 1 h M M 1 1 1 : 1 h I II I M M M M M 1 1 1 1 M M M II II M 1 1 II i II 

g2 1 2 PQTSERKKPEHVLVIGAGI AGASTAHALASHGI SVTVLEARKAAQAASGNRQGLLYAKI S 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 1 2 . pep PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGI IHLNYSRTEQQRNHELGLQKHHNH 

I I 1 1 I I I! I i I IN I i II I! hhilll !l i II IMI!!! MINI I I I IN! I I 

g212 PHDTGQTELLLAGYGYTKRLLGHILPDSDTWGGNGI I HLNYSRTEQQRNHELGLQKHHNH 

190 200 210 220 230 240 

250 260 270 280 290 300 

m2 12 . pep LYRS I TS AE AE KI AG I PLSVPYDHPSCGLYWQHGVWLNPPAFVRTLLNHPLIGLHEDTPL 

M M M I Mil III MM-l! n IIIIIIMMMIIIIIIMhMM hhl I 

g212 LYRSITSAEAEKIAGIPLNTPYAEPLCGLYWQHGVWLNPPAFVRTLLSHPLIELYENTTL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m212 . pep TDISHDGEKWIASTPNGTFTATHIIYCTGANSPYLPETNLAALPLRQIRGQTGLTPSTPF 

i n n i n 1 1 1 1 1 inin 1 1 ii inn mi miimmmimiimiiinmu 

g212 TGISHDGEKWIASTPNGTFTATHIIYCTGAHSPCLPETNLAALPLRQIRGQTGLTPSTPF 

310 320 330 340 350 360 



m212 -pep 



370 380 390 400 410 420 

SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 
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g212 SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSNTGWNEAEEASNRQALAHLNPALAESL 
370 380 390 400 410 420 

430 440 450 460 470 480 

m2 12 . pep FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMRQTyTKLALDKNYRIDTPCPYLPN 
I 

g212 F 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 737>: 

a212 . seq 

1 ATGGACAATC TCGCATGGAA CGGCATTCCC GACATCCGCA CACTCGACCA 

51 AACCATCCGC AAACACGCAC ACCCGCTCAA CCTGATTGTC TGCCTCCCCG 

101 ATAATCAGAT TCCCAATTTT CAAACCGCAC AAGATGCTTC GGACGCGGAA 

151 TGCCGTCTGA AGCACCGTTT GGATCAGGCA ACCCAGTGCC TCCAGTTCGA 

201 CAGCATCAAC CTGATTGAAC ACATCCTGCC CGATGTCCGC TTCTGGCTGG 

251 TTCCCCCTTC ACGCACCCGC CGCCTGCACG AACACTTCCA CCACATTTCC 

301 TGGCAGACCG AAGCCATCCC GCAAACCGAA AGTAAGCCCG ACAAACCCTG 

351 GTTTGCACTT CCACAAACAT CCGAACGGCA AAAACCGGAA CACATCCTCG 

4 01 TTATCGGAGC GGGCATATCC GGCGCGGCAA CCGCCCACGC CT TAG CATC A 

4 51 TACGGCATTT CCGTTACCGT ATTGGAAGCC CGAAAAGCCG CCCAAGCCGC 

501 CAGCGGCAAC CGCCAAGGGC TGCTCTACGC CAAAATCTCG CCGCACGACA 

551 CCGAACAAAC CGAACTGCTG CTTGCCGGCT ACGGCTACAC CAAACGCCTG 

601 CTCGGACATA TCCTGCCCGA ATCCGAAACC TGGGGCGGCA ACGGCATCAT 

651 CCACCTCAAT TACAGCCGCA CCGAACAACA ACGCAATCAC GAATTGGGTT 

701 TGCAAAAACA CCATAACCAC CTCTACCGCA GCATCACGCA GGCAGAAGCC 

751 GAAAAAATCG CCGGCATCCC TCTGAACACG CCCTACGCCG AACCATTATG 

801 CGGACTGTTT TGGCAGTACG GCGTATGGCT CAATCCTCCC ACATTCGTCC 

8 51 GCGCCCTCCT CAGCCATCCG CTCATTGGAC TACACGAAGA CACACCGTTA 

901 ACCGACATTT CCCACGACGG GGAAAAGTGG ATTGCAAGCA CGCCAAACGG 

951 CACATTTACC GCCACACACA TCATCTACTG CACCGGTGCG AACAGCCCCT 

1001 ACCTACCCGA AACCAACCTC GCCACCCTGC CCCTCAGGCA AATACGCGGA 

1051 CAAACCGGCC TCACACCGTC CACCCCGTTT TCCGAACAAC TGCGTTGCGC 

1101 CGTTTCAGGC GAAAGCTACA TCAGCCCGTC GTGGCACGGA CTGCACTGCT 

1151 ACGGCGCGAG TTTTATTCCC AACAGCAGCC ATACCGGATG GAACGAAGCC 

1201 GAAGAAGCCT CAAACCGCCA AGCATTGGCA CACCTTAACC CCGCCCTTTC 

1251 CGAATCATTG TTTGCCGCCA ACCCAAACCC CCAAAAACAC CAAGGGCACG 

1301 CCGCCATACG CTGCGACAGC CCCGACCACC TTCCCCTAGT CGGCGCACTC 

1351 GGCGACATTG CCGCTATGCA ACAAACTTAC GCCAAACTCG CGCTGGACAA 

14 01 AAACTATCGC ATCGATGCCC CCTGCCCGTA CCTGCCCAAT GCCTACGCCA 

14 51 ACACCGCCCA CGGCACACGC GGGCTTGCCA CCGCCCCCAT CTGCGCCGCC 

1501 GCCGTTGCAG CCGAAATCCT AGGCTTGCCC CATCCCCTCT CAAAACGCCT 

1551 GCGCCACGCC CTACACCCCA ACCGCGCCAT CATCCGCGCC ATCGTCAGAA 

1601 GGAAGGATCT AACCCCTTAA 

This corresponds to the amino acid sequence <SEQ ID 738; ORF 212. a>: 

a212 .pep 

1 MDNLAWNGIP DIRTLDQTIR KHAHPLNLIV CLPDNQIPNF QTAQDASDAE 

51 CRLKHRLDQA TQCLQFDSIN LIEHILPDVR FWLVPPSRTR RLHEHFHHIS 

101 WQTEAIPQTE SKPDKPWFAL PQTSERQKPE H ILVIGAGIS GAATAHAL AS 

151 YGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGHILPESET WGGNGIIHLN YSRTEQQRNH ELGLQKHHNH LYRSITQAEA 

251 EKIAGIPLNT PYAEPLCGLF WQYGVWLNPP TFVRALLSHP LIGLHEDTPL 

301 TDISHDGEKW IASTPNGTFT ATHIIYCTGA NSPYLPETNL ATLPLRQIRG 

351 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 

4 01 EEASNRQALA HLNPALSESL FAANPNPQKH QGHAAIRCDS PDHLPLVGAL 

4 51 GDIAAMQQTY AKLALDKNYR IDAPCPYLPN AYANTAHGTR G LATAPICAA 

501 AVAAEILGLP HPLSKRLRHA LHPNRAIIRA IVRRKDLTP* 

m212/a212 93.7% identity in 539 aa overlap 

10 20 30 40 50 60 

m212.pep MDNLVWDGIPDIRTLDQAIRKHAPPLNLIICLPDNQIPDFQTAQDASDAECRLKHRLDQA 
I I I I : I : I I I I I M I I I : I 1 I I I 1 I I I I : I i I I I I I t : I I I I I I I I II I I I I i I I I I I I 

a212 M DN L AWN G I PD I RTL DQT I RKHAH P LN LI VC L P DN Q I PN FQT AQD AS D AEC RLKHRL DQ A 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m212.pep MQCLQFDSINLIEHILPDVRFWLVPPSRTHHLHEHFHHISWQTEAIPQTESKPDKPWFAL 
I I I I i I I I I I I I 1 I I I I I I I t I I I II I I : : I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 
a2 1 2 TQCLQFDS I NLIEHI LPDVRFWLVPPSRTRRLHEHFHHI SWQTEAI PQTES KPDKPWFAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m212 .pep PQTSERQKPEHILVIGAGI SGAATAHALASHGI SVTVLEARKAAQAASGNRQGLLYAKI S 

I I I I I I I I i f i I I If I M I t M i ! I I i t I I : I t I I I I I i I I t I I I I t i I I I I It I I II I t 
a2 1 2 PQTSERQKPEHI LVI GAG I S GAAT AHALAS YGI SVTVLEARKAAQAASGNRQGLLYAKI S 

130 140 150 160 170 180 

190 200 210 220 230 240 

m2 12 . pep PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 
I I I I I I II I I I I I I I I I I I I M I I.I I I II I I I I I i I M I I I I I II I I I I I I I II II I I I I 
a212 PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 

190 200 210 220 230 240 

250 260 270 280 290 300 

m212 . pep LYRSITSAEAEKIAGIPLSVPYDHPSCGLYWQHGVWLNPPAFVRTLLNHPLIGLHEDTPL 
I I I I I I : II I II I I I I I I : : I I : I I I I : I I : II II I I I : I I I : I I : I II I I I I I I I I I 
a212 LYRSITQAEAEKIAGIPLNTPYAEPLCGLFWQYGVWLNPPTFVRALLSHPLIGLHEDTPL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m2 12 . pep TDISHDGEKWIASTPNGTFTATHII YCTGANSPYLPETNLAALFLRQIRGQTGLTPSTPF 

I I I I I I I I I I I I I I 1 I I I II I I I I I I I I I I I 1 I 1 M I 1 I I I : I I I I I I I 1 I I t I I I 1 II I 
a212 TDISHDGEKWIASTPNGTFTATHIIYCTGANSPYLPETNLATLPLRQIRGQTGLTPSTPF 

310 320 330 340 350 360 

370 380 390 400 410 420 

m212 . pep S EQL RC A VS GE S Y I S PS WHGLHC YGA S F I PN S S H TG WNE AE E A S NRQAL AHLN PAL S ES L 

I I I II I I I I I I II I I I I I I I I I I I I I I I 11 II I I I I I I I I I I I i I I I 1 I I I I I I I I I I I 1 
a212 SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 

370 380 390 400 410 420 

430 440 450 460 470 480 

m2 12 . pep FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMRQTYTKLALDKNYRIDTPCPYLPN 
I I I I I I 1 I 1 II I I I I 1 I I I I I It I I I I 1 I I I i I II I : I I I : I I I I I I I I I I I : I I I I I I I 
a212 FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMQQTYAKLALDKNYRIDAPCPYLPN 

430 440 450 460 470 480 

490 500 510 520 530 540 

m212 .pep AYVNTAHGTRGLATAPICAAXXAAQIXGLPHPFXQRLRHALHPNRTIIRAIVRRKDLTPX 
I 1 : M I I 1 I I I I I I 1 I I I I I I I : I Mill: : I I I 1 I 1 I I I I : 1 I ! I I I I I I I I I 11 
a212 AYANTAHGTRGLATAPICAAAVAAEILGLPHPLSKRLRHALHPNRAIIRAIVRRKDLTPX 

490 500 510 520 530 540 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 739>: 

g2l4 . seq 

1 atgatacaaa agatatgtaa gctatttgtt ttaattgtaa tttttgcaac 

51 ttctcccgct tttgcccttc aaagcgacag cagacggccc atccaaatcg 

101 aagccgacca aggttcgctc gatcaagcca accaaaggac cacatttagc 

151 ggcaatgtca tcatcagaca gggtacgctc aacatttccg cctcgtgtgt 

2 01 caacgtcaca cgcggcaggc aaaggcggcg aatccgtgag ggcggaaggt 

2 51 tcgcccgtcc gcttcagcca aacgttggac gggggcaaag ggacggtgcg 

3 01 cggtcaggca aacaacgtta cctattcctc cgcaggaagc actgtcgttc 

3 51 tgaccggcaa tgccaaagtg cagcgcggcg gcgacgttgc cgaaggtgcg 

4 01 gtcattacct acaacaccaa aaccgaagtc tataccatca acggcagcac 
4 51 gaaatcgggt gcgaaatccg cttccaaaac cggcagggtc agcgtcgtca 
501 tccagccttc aagcacacaa aaaaccgaat aaccccgatg ccgtctgaaa 
551 cggaaacgca gttcagacgg catttgccga ccgaaatgcc gagaagagat 
601 tattga 

This corresponds to the amino acid sequence <SEQ ID 740; ORF 214.ng>: 
g2l4 .pep 
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m214 . seq 


(partial) 


1 


ATGATACAAA 


51 


GTCCCCCGCT 


101 


AGGCCGACCA 


151 


GGAAACGTCG 


201 


CAATGTTACA 


251 


CGCCAGTCCG 


301 


GGACAGGCAA 


351 


AACCGGTAAT 


401 


TGATTACATA 


451 


AAATT. . . 
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1 MIQKICKLFV LIVIFATSPA FA LQSDSRRP IQIEADQGSL DQANQRTTFS 
51 GNVIIRQGTL NISASCVNVT RGRQRRRIRE GGRFARPLQP NVGRGQRDGA 
101 RSGKQRYLFL RRKHCRSDRQ CQSAARRRRC RRCGHYLQHQ NRSLYHQRQH 
151 EIGCEIRFQN RQGQRRHPAF KHTKNRITPM PSETETQFRR HLPTEMPRRD 
201 Y 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 741>: 

AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 
TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 
AGGTTCGCTC GATCAAG CCA ACCAAAGCAC CACATTCAGC 
TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 
CGCGGCGGCA AAGGCGGCGA AT CCGTGAGG GCGGAAGGTT 
CTTCAGCCAG ACATTGGACG GCGGCAAAGG CACGGTGCGC 
ACAACGTTGC TTATTCATCT GCAGGCAGCA CCGTAGTCTT 
GCCAAAGTAC AGCGCGGCGG CGATGTCGCC GAAGGTG CGG 



This corresponds to the amino acid sequence <SEQ ID 742; ORF 214>: 

m214.pep (partial) 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
51 GNWIRQGTL NISAARVNVT RGRQRRRIRE GGRFASPLQP DIGRRQRHGA 
101 RTGKQRCLFI CRQHRSLNR* CQSTARRRCR RRCGDYIQHQ NRSLYHQRQH 
151 KI. . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 214 shows 80.3% identity over a 152 aa overlap with a predicted ORF (ORF 214.ng) 
from N. gonorrhoeae: 

m214/g214 

10 20 30 40 50 60 

m214 .pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 

MMMIIMM-hMIIIMMMhlMMIilillillll I I I M I I = i I I I i I 
g214 MIQKICKLFVLIVIFATSPAFALQSDSRRPIQIEADQGSLDQANQRTTFSGNVIIRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m214.pep NI SAARVNVTRGRQRRRIREGGRFASPLQPD IGRRQRHGARTGKQRCLFI CRQHRSLNRX 

MM: i! I II M ill lill II Ml MlhMI II 111 = 1111 Ih hi I 
g214 NISASCVNVTRGRQRRRIREGGRFARPLQPNVGRGQRDGARSGKQRYLFLRRKHCRSDRQ 

70 80 90 100 110 120 

130 140 150 

m2 14 . pep CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKI 

11 1 - 1 1 1 1 MM I ■• M I M ( 1 1 1 M I ! ■• ! 

g214 CQS AARRRRCRRCGHYLQHQNRS LYHQRQHE I GCE I RFQNRQGQRRHPAFKHTKNR I T PM 

130 140 150 160 170 180 

g2 14 PSETETQFRRHLPTEMPRRDY 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 743>: 

a214 . seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 

201 CAATGTTACA CGCGGC.GGC AAAGGCGGCG AATCCGTGAG GGCGGAAGGT 

251 TCGCCAGTCC GCTTCAGCCA GACATTGGAC GGCGGCAAAG GCACGGTGCG 

301 CGGACAGGCA AACAACGTTG CTTATTCATC TGCAGGCAGC ACCGTAGTCT 
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351 TAACCGGTAA TGCCAAAGTA CAGCGCGGCG GCGATGTCGC CGAAGGTGCG 

401 GTGATTACAT ACAACACCAA AACCGAAGTC TAT AC CATC A GCGGCAGCAC 

451 AAAATCCGGC GCAAAATCCG CTTCCAAATC CGGCAGGGTC AGCGTCGTTA 

501 TCCAGCCTTC GAGTACGCAA AAATCCGAAT AATCCCAATG CCGTCTGAAA 

551 CATAAACCTG GTTCGGACGG CATTTGCCGA CCGAAATATT GAAGAGATAT 

601 TTATGA 

This corresponds to the amino acid sequence <SEQ ID 744; ORF 214.a>: 

a214 .pep 

1 MIQKICKLFV LIAFFSAS PA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 

51 GNWIRQGTL NISAARVNVT RGXQRRRIRE GGRFASPLQP DIGRRQRHGA 

101 RTGKQRCLFI CRQHRSLNR* CQSTARRRCR RRCGDYIQHQ NRSLYHQRQH 

151 KIRRKIRFQI RQGQRRYPAF EYAKIRIIPM PSET*TWFGR HLPTEILKRY 

201 L* 

m214/a214 99.3% identity in 152 aa overlap 

10 20 30 -40 50 60 

m214.pep MIQKICKLFVLIAFFSAS PAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 

I I I I J ! ! f I ) ! I I II I 1 I M I I I I I I I I ! I I I I ! 1 I I II I I I I I I I I I I I I I I I I I I I I I 
a214 MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m214 .pep NISAARVNVTRGRQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFICRQHRSLNRX 
I I II I I I I 11 I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I M I I It I I I I I I 
a214 NISAARVNVTRGXQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFICRQHRSLNRX 

70 80 90 100 110 120 

130 140 150 

m214 . pep CQ ST ARRRCRRRCG D Y I QHQN RS L YHQRQHK I 

1 I I I I I I I I I I I I I II I ! 1 I I I I I N I I M I I 
a214 CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKIRRKIRFQIRQGQRRYPAFEYAKIRIIPM 

130 140 150 160 170 180 

a214 PSETXTWFGRHLPTEILKRYLX 
190 200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 745>: 

g214 -1 . seq 

1 AT GAT AC AAA AGATATGTAA GCTATTTGTT TTAATTGTAA TTTTTGCAAC 

51 TTCTCCCGCT TTTGCCCTTC AAAGCGACAG CAGACGGCCC ATCCAAATCG 

101 AAGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGTAC CACATTTAGC 

151 GGCAATGTCA TCATCAGACA GGGTACGCTC AACATTTCCG CCTCGCGCGT 

201 CAACGTCACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCCGTCCG CTTCAGCCAA ACGTTGGACG GGGGCAAAGG GACGGTGCGC 

301 GGTCAGGCAA ACAACGTTAC CTATTCCTCC GCAGGAAGCA CCGTCGTTCT 

351 GACCGGCAAT GCCAAAGTGC AGCGCGGCGG CGACGTTGCC GAAGGTGCGG 

401 TCATTACCTA CAACACCAAA ACCGAAGTCT ATACCATCAA CGGCAGCACG 

451 AAATCGGGTG CGAAATCCGC TTCCAAAACC GGCAGGGTCA GCGTCGTCAT 

501 CCAGCCTTCA AGCACACAAA AAACCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 746; ORF 214-1. ng>: 

g214-l.pep 

1 MIQKICKLFV LIVIFATSPA FA LQSDSRRP IQIEADQGSL DQANQSTTFS 
51 GNVIIRQGTL NISASRVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVTYSS AGSTVVLTGN AKVQRGGDVA EGAVTTYNTK TEVYTINGST 
151 KSGAKSASKT GRVSWIQPS STQKTE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 747>: 

m214-l.seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 
51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 
101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 
151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 
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201 CAATGTTACA CGCGGCGGCA 

251 CGCCAGTCCG CTTCAGCCAG 

301 GGACAGGCAA ACAACGTTGC 

351 AACCGGTAAT GCCAAAGTAC 

401 TGATTACATA CAACACCAAA 

451 AAATCCGGCG CAAAATCCGC 

501 CCAGCCTTCG AGTACGCAAA 



AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 
ACATTGGACG GCGGCAAAGG CACGGTGCGC 
TTATTCATCT GCAGGCAGCA CCGTAGTCTT 
AGCGCGGCGG CGATGTCGCC GAAGGTGCGG 
ACCGAAGTCT ATACCATCAG CGGCAGCACA 
TTCCAAATCC GGCAGGGTCA GCGTCGTTAT 
AATCCGAATA A 



This corresponds to the amino acid sequence <SEQ ID 748; ORF 214-1>: 

m214-l .pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
51 GNVVIRQGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVAYSS AGSTVVLTGN AKVQRGGDVA EGAVITYNTK TEVYTISGST 
151 KSGAKSASKS GRVSVVIQPS STQKSE* 



m214-l/g214-l 93.8% identity in 176 aa overlap 



10 20 30 40 50 60 

m2 14-1. pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 
I 1 I I I I t i II I I : : I : : I ! I I I I I 1 I 1 I : I I I I t I I M i I I I I I I f I 1 1 i 1 I I : I I I I I I 
g2 1 4 - 1 MI QKI CKLFVLI VI FATSPAFALQSDSRRPI QIEADQGSLDQANQSTTFSGNVI I RQGTL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m214-l . pep NISAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSSAGSTVVLTGN 
I I I I : I I I I I I I 1 1 I I I I I I I I I I I I I I I I (I I I I I I I I I I I I I I I : I I E I I I I I I I I I 1 
g214-l NIS AS RVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVTYSS AGSTVVLTGN 

70 80 90 100 110 120 



130 140 150 160 170 

m2 1 4 - 1 . pep AKVQRGGDVAEGAVITYNTKTEVYTI SGSTKSGAKSASKSGRVS VVIQPSSTQKSEX 
I I I E I I I I I I I I I I I I I I I I I I I I It : I I I I 1 I I I I I I I : I I I I I I I I I I I I I I : I I 
g2l4-l AKVQRGGDVAEGAVITYNTKTEVYTINGSTKSGAKSASKTGRVSVVIQPSSTQKTEX 

130 140 150 160 170 



g214-l/p38685 

sp| P38685 1 YHBN_ECOLI 17.3 KD PROTEIN IN MURA-RPON INTERGENIC REGION PRECURSOR (ORF185) 
>gi 1 551336 (U12684) orfl85 [Escherichia coli] >gi 1 606139 (018997) ORF_ol85 [Escherichia coli] 
>gi 1 1789592 (AE000399) orf, hypothetical protein [Escherichia coli] Length = 185 
Score =97.1 bits (238), Expect = 6e-20 

Identities = 57/126 (45%), Positives = 74/126 (58%), Gaps = 3/126 (2%) 



Query; 19 PAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTLNISAARVNVTR — GGKGG 7 6 

FAFA+ D+ QPI IE+DQ SLD TF+GNV++ QGT+ I+A +V VTR G +G 

Sbjct: 24 PA F A VT GDT DQPIHIES DQQSLDMQGN V VT FTGN V I VTQGT I K I N ADK W VT RPGG EQGK 83 

Query: 77 E S VRAEG S P VRF S QT LDGGKGT VRGQ AN N V A Y S S AG S T V VLTGN AK VQRG G D V AEG A V I T 136 

£ + GP F Q DGK VGA+ + Y A VVLTGNA +Q+ +G IT 

Sbjct: 84 EVIDGYGKPATFYQMQDNGK-PVEGHASQMHYELAKDFVVLTGNAYLQQVDSNIKGDKIT 142 



Query: 137 YNTKTE 142 

Y K + 
Sbjct: 143 YLVKEQ 148 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 749>: 



a2l4-l .seq 

1 ATGATACAAA 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 



GTCCCCCGCT 
AGGCCGACCA 
GGAAACGTCG 
CAATGTTACA 
CGCCAGTCCG 
GGACAGGCAA 
AACCGGTAAT 
TGATTACATA 
AAATCCGGCG 
CCAGCCTTCG 



AGATATGTAA 
TTTGCCCTTC 
AGGTTCGCTC 
T CATC AG AC A 
CGCGGCGGCA 
CTTCAGCCAG 
ACAACGTTGC 
GCCAAAGTAC 
CAACACCAAA 
CAAAATCCGC 
AGTACGCAAA 



GCTATTTGTT 
AAAGCGACAG 
GATCAAGCCA 
GGGTACGCTC 
AAGGCGGCGA 
ACATTGGACG 
TTATTCATCT 
AGCGCGGCGG 
ACCGAAGTCT 
TTCCAAATCC 
AATCCGAATA 



TTAATAGCAT 
CAGGCAGCCT 
ACCAAAGCAC 
AATATTTCCG 
ATCCGTGAGG 
GCGGCAAAGG 
GCAGGCAGCA 
CGATGTCGCC 
ATACCATCAG 
GGCAGGGTCA 
A 



TTTTTTCGGC 
ATTCAGATTG 
CACATTCAGC 
CCGCCCGCGT 
GCGGAAGGTT 
CACGGTGCGC 
CCGTAGTCTT 
GAAGGTGCGG 
CGGCAGCACA 
GCGTCGTTAT 



This corresponds to the amino acid sequence <SEQ ID 750; ORF 214-La>: 

a214-l .pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
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51 GNVVIRQGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 
101 GQANNVAYSS AGSTVVLTGN AKVQRGGDVA EGAVITYNTK TEVYTISGST 
151 KSGAKSASKS GRVSVVIQPS STQKSE* 

a214-l/m214-l 100.0% identity in 176 aa overlap 

10 20 30 40 50 60 

a214-l.pep MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 
I I I I I I t I I I I I I I 1 I ! I I I I t I I I I ) I I I 1 I I I I I I I I t I 1 I I ) t I I I I ) I I I I I t I I I 
m214-l MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 

10 20 30 40 50 60 

70 80 90 100 110 120 

a2 1 4 - 1 . pep NI SAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSSAGST VVLTGN 
I ) I II I I I t I I I I 1 I I I I I I I I I I I 1 I I I i I I I I I I I I I I I I I I I 1 I I I I 1 I I I I I I 1 I I 
m2 1 4 - 1 NI SAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVAYSSAGSTVVLTGN 

70 80 90 100 110 120 

130 140 150 160 170 

a214-l.pep AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSVVIQPSSTQKSEX 
I I I I I I I I I I I I I 1 I I I I I II I ! I I I I t I I II I I II I I I I I I I I I I I I I I M I I I 1 I 
Iti214-1 AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSWIQPSSTQKSEX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 75 1>: 

g215 . seq 

1 atgaaagtaa gatggcggta cggaattgcg ttcccattga tattggcggt 

51 tgccttgggc agcctgtcgg catggttggg ccgtatcagc gaagtcgaaa 

101 tcgaggaagt caggctcaat cccgacgaac ctcaatacac aatggacggc 

151 ttggacggaa ggcggtttga cgaacaggga tacttgaaag aacatttgag 

201 cgcgaaaggt gcgaaacagt ttcccgaaaa cagcgacatc cattttgatt 

251 cgccgcatct cgtgttcttc caagaaggca ggctgttgta cgaagtcggc 

3 01 agcgatgaag ccgtttacca taccgaaaac aaacaggttc tttttaaaaa 

351 caacgttgtg ctgaccaaaa ccgccgacgg caggcggcag gcgggtaaag 

401 tcgaaaccga aaaactgcac gtcgataccg aatctcaata tgcccaaacc 

451 gatacgcctg tcagtttcca atatggcgcg tcgcacggtc aggcgggcgg 

501 tatgacctac aaccacaaaa caggcatgtt gaacttctca tctaaagtga 

551 aagccgcgat ttatgataca aaagatatgt aa 

This corresponds to the amino acid sequence <SEQ ID 752; ORF 215.ng>: 

g215 -pep 

1 MKVRWRYGIA FPLILAVALG SLSA WLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSAKG AKQFPENSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNW LTKTADGRRQ AGKVETEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY NHKTGMLNFS SKVKAAIYDT KDM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 753>: 

ra215.seq (partial) 

1 . . AGCCTGTCGG CATGGTTGGG TCGTATCAGC GAAGTCGAGA TTGAAGAAGT 

51 CAGGCTCAAT CCCGACGAAC CGCAATACAC AATGGACAGC TTGGACGGCA 

101 GGCGGTTTGA CGAACAGGGA TACTTGAAAG AACATTTGAG CGCGAAGGGC 

151 GCGAAACAGT TTCCGGAAAG CAGCGACATC CATTTTGATT CGCCGCATCT 

201 CGTGTTCTTC CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC AGCGACGAAG 

251 CCGTTTACCA TACCGAAAAC AAACAGGTTC TTTTTAAAAA CAACGTTGTG 

3 01 CTGACCAAAA CCGCCGACGG CAAACGG CAG GCGGGTAAAG TTGAAGCCGA 

3 51 AAAGCTGCAC GTCGATACCG AATCTCAATA TGCCCAAACC GATACGCCTG 
401 CAGTTTCCA ATATGGTGCA TCGCACGGTC AGGCGGGCGG CATGACTTAC 

4 51 GACCACAwwA CAGGCATGTT GAACTTCTCA TCTAAAGTGA AAGCCACGAT 
501 TTATGATACA AAAGATATGT AA 

This corresponds to the amino acid sequence <SEQ ID 754; ORF 21 5>: 

m215.pep (partial) 

1 . . SLSAWLGRIS EVEIEEVRLN PDEPQYTMDS LDGRRFDEQG YLKEHLSAKG 

51 AKQFPESSDI HFDSPHLVFF QEGRLLYEVG SDEAVYHTEN KQVLFKNNW 

101 LTKTADGKRQ AGKVEAEKLH VDTESQYAQT DTPVSFQYGA SHGQAGGMTY 



WO 99/57280 



PCT/US99/09346 



496 



151 DHXTGMLNFS SKVKATIYDT KDM* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from AT. gonorrhoeae 

ORF 215 shows 96.0% identity over a 173 aa overlap with a predicted ORF (ORF 215.ng) 
from N. gonorrhoeae: 

m215/g215 

m215 .pep 
g215 

50 60 70 80 90 100 

m215 . pep YLKEHLSAKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNVV 

II I II I M M M I M I ! II I M I M II II M 1 1 1 1 1 M M M M 1 1 M I M 11 II ! II ! 

g215 YLKEHLSAKGAKQFPENSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 

70 80 90 100 110 120 



10 20 30 40 

SLSAWLGRISEVEIEEVRLNPDEPQYTMDSLDGRRFDEQG 

) 1 1 } ) i 1 1 1 1 1 1 1 1 1 ) 1 1 1 M 1 1 1 ) i 1 1 ! = J 1 1 1 1 1 1 1 1 ! 

MKVRWRYGIAFPLILAVALGSLSAWLGRISEVEIEEVRLNPDEPQYTMDGLDGRRFDEQG 
10 20 30 40 50 60 



110 120 130 140 150 160 

m215 . pep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 

MIMIhlMlilhMIIIMIIMIMMMilHMMMIMMhi [mill 

g2 1 5 LTKTADGRRQAGJCVETEKL.HVDTESQYAQTDTPVSFQYGASHGQAGGMTYNHKTGMLNFS 

130 140 150 160 170 180 

170 

m215.pep SKVKATIYDTKDMX 

llllhlMIIII 

g215 SKVKAAIYDTKDM 

190 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 755>: 

a215 . seq 

1 ATGAAAGTAA GATGGCGGTA CGGAATTGCG TTCCCATTGA TATTGGCGGT 

51 TGCCTTGGGC AGCCTGTCGG CATGGTTGGG ACGCATCAGC GAAGTCGAGA 

101 TTGAAGAAGT CAGGCTCAAT CCCGACGAAC CGCAATACAC AATGGACGGA 

151 TTGGATGGCA GGCGGTTTGA CGAACAGGGA TACTTGAAAG AACATTTGAG 

201 TTCGAAGGGC GCGAAACAGT TTCCCGAAAG CAGCGACATT CATTTCGACT 

251 CACCGCATCT CGTGTTCTTC CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC 

301 AGCGATGAAG CCGTTTACCA TACCGAAAAC AAACAGGTTC TTTTTAAAAA 

351 CAACGTTGTG CTGACCAAAA CCGCCGACGG CAAACGGCAG GCGGGTAAAG 

401 TTGAAGCCGA AAAGCTGCAC GTCGATACCG AATCTCAATA TGCCCAAACC 

451 GATACGCCTG TCAGTTTCCA ATATGGTGCA TCGCACGGTC AGGCGGGCGG 

501 CATGACTTAC GACCACAAAA CAGGCATGTT GAACTTCTCA TCTAAAGTGA 

551 AAGCCACGAT TT AT GAT AC A AAAGATATGT AA 



This corresponds to the amino acid sequence <SEQ ID 756; ORF 215.a>: 

a215 .pep 

1 MKVRWRYGIA FPLILAVALG SLS AWLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSSKG AKQFPESSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNVV LTKTADGKRQ AGKVEAEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY DHKTGMLNFS SKVKATIYDT KDM* 

m2 1 5/a2 15 98.3% identity in 1 73 aa overlap 

10 20 30 40 

m215.pep SLSAWLGRISEVEIEEVRLNPDEPQYTMDS LDGRRFDEQG 

M I I I I I M I I I I I I I I M M M I I II t I : I I I I 1 I I I I I 
a215 MKVRWRYGIAFPLILAVALGSLSAWLGRISEVEIEEVRLNPDEPQYTMDGLDGRRFDEQG 

10 20 30 40 50 60 

50 60 70 80 90 100 

m215 . pep YLKEHLSAKGAKQFPES5DIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNW 
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M | I M I : I I I I I M I M I I M I I M 1 i I I I I I I I 1 II I I II I I I I I I I I ) I I I I 

a2 15 YLKEHLSSKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKNNVV 

70 80 . 90 100 110 120 

110 120 130 140 150 160 

m215 . pep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHXTGMLNFS 

I I I I I I I I I I I M I I II I I I I M I M I 1 I I I I II I I I I It I I I I I I J I I I II I I 

a 215 LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHKTGMLNFS 

130 140 150 160 170 180 

170 

m2 1 5 . pep SKVKAT I YDTKDMX 

I M I I I I II I I I 1 I 
a215 SKVKAT I YDTKDMX 

190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 757>: 

g216.seq (partial) 

1 . .atgatatcga tttcgagctc ggtacccagc gacgaaatca ccgccatcat 

51 ccccgcactc aaacgcaaag acattaccct cgtctgcatc accgcccgcc 

101 ccgattcaac catggcgcgc catgccgata tccacatcac cgcatcggtt 

151 tcgcaagaag cctgcccgtt ggggcttgcc ccgaccacca gcaccaccgc 

201 cgttatggct ttgggcgacg cgttggcggt cgtcctgctg cgcgcccgcg 

251 cgttcacgcc cgacgacttc gccttgatcc accctgccgg cagcctcggc 

3 01 aaacgcctgc ttttgcgcgt tgccgacatt atgcacaaag gcggcggcct 

3 51 gcccgccgtc cgactcggca cgcccttgaa aggagccatc gtcagcatga 

401 gcgagaaagg tttgggcatg tgggcgggaa cggacgggca aaggctgtct 

451 gaaaggcctt tttactga 

This corresponds to the amino acid sequence <SEQ ID 758; ORF 216.ng>: 

g216.pep (partial) 

1 . .MISISSSVPS DEITAIIPAL KRKDITLVCI TARPDSTMAR HADIHITASV 

51 SQEACPLGLA PTT STTAVMA LGDALAWLL RARAFTPDDF AL I HP AGS LG 

101 KRLLLRVAD I MHKGGGLPAV RLGTPLKGAI VSMSEKGLGM WAGTDGQRLS 

151 ERPFY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 759>: 

m216 . seq 

1 ATGGCAATGG CAGAAAACGG AAAATATCTC GACTGGGCAC GCGAAGTGTT 

51 GCACGCCGAA GCGGAAGGCT TGCGCGAAAT TGCAGCGGAA TTGsACAAAA 

101 ACTTCGTCCT TGCGGCAGAC GCGTTGTTGC ACTGCAAGGG CAGGGTCGTT 

151 AT CACGGGC A TGGTCAAGTC GGGACATATC GGGCGCAAAA TGGCGGCAAC 

2 01 TATGGCCTCG ACCGGCACGC CTGCGTTTTT CGTCCACCCT GCGGAAGCGG 
251 CACACGgCGA TTTGGGTATG ATTGTGGACA rCGACGTGGT CGTCGCGATT 
301 TCCAATTCCG GCGAAAGCGA CGAAATCGCC GCCATCATCC CCGCACTCAA 

3 51 ACGCAAAGAC ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

4 01 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 
4 51 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TCATGGCTTT 
501 GGGCGATGCG TTGGCGGTCG TCCtGCTGCG CgcACGCGCG TTCACGCCCG 
551 ACGATTTCGC CTTGAGCCAT CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 
601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

6 51 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

7 01 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 
7 51 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 
801 TCTTT CGAT A GACGAAGTCA TG CAT ACG C A TCCTAAAACC ATCTCCGCCG 
851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 
901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTGAATAT 
951 GCACGACCTG CTGGCGGCAC GGATTGTATA G 

This corresponds to the amino acid sequence <SEQ ID 760; ORF 21 6>: 

m2l6 .pep 

1 MAMAENGKYL DWAREVLHAE AEGLREIAAE LXKNFVLAAD ALLHCKGRW 
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51 I TGMVKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDXDVWAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 

151 CPLGIAPT TS TTAVMALGDA LAWLLRARA FTPDDFALSH PAGSLGKRLL 

201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

251 TDGDLRRLFQ ECDNFTGLSI DEVMHTHPKT ISAERLATEA LKVMQANHVN 

301 GLLVTDADGV LIGALNMHDL LAARIV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A^. gonorrhoeae 

ORF 216 shows 91.8% identity over a 147 aa overlap with a predicted ORF (ORF 216.ng) 
from N. gonorrhoeae: 

m216/g216 



70 80 90 100 110 120 

m216 . pep TMASTGTPAFFVHPAEAAHGDLGMIVDXDVWAISNSGESDEIAAIIPALKRKDITLVCI 

:::|hl I I I h I I I I I M I I I I I 1 I I I 
g216 MISISSSVPSDEITAIIPALKRKDITLVCI 

10 20 30 



130 140 150 160 170 180 

m2 1 6 . pep TARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARAFTPDDF 

M ! 1 1 1 1 1 1 M II 1 1 i 1 1 1 1 1 = 1 1 M I M 1 i f I ! 1 1 M It 1 1 1 i I M 1 1 1 1 1 1 1 1 i 1 1 1 1 

g2 1 6 TARPDSTMARHADIHI TASVSQ EACPLGLAPTTSTTAVMALGDALAWLLRARAFTPDDF 

40 50 60 70 80 90 



190 200 210 220 230 240 

m2 1 6 . pep ALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAI VSMSEKGLGMLAVTDGQGRL 

II I MM II III I i 1 1 !M 1 1 II Ml 1 1 II Ml I II 1 1 i I I I 1 1 1 1 1 i I 1 1 II 

g2 1 6 ALIHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKGAI VSMSEKGLGMWAGTDGQRLS 

100 110 120 130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 76 1>: 

a216.seq 

1 ATGGCGATGG CAGGAAACGA AAAATATCTT GATTGGGCAC GCGAAGTGTT 

51 GCACACCGAA GCGGAAGGCT TGCGCGAAAT TGCGGCGGAT TTGGACGAAA 

101 ACTTCGCCCT TGCGGCGGAC GCGTTGTTGC ACTGCAAAGG CAGGGTCGTT 

151 ATCACGGGCA TGGGCAAGTC GGGACATATC GGGCGCAAAA TGGCGGCAAC 

201 CATGGCCTCG ACCGGCACGC CCGCGTTTTT CGTCCACCCT GCGGAAGCGG 

251 CACACGGCGA TTTGGGCATG ATTGTGGACA ACGACGTGGT CGTCGCGATT 

301 TCCAATTCCG GTGAAAGCGA CGAAATCGCC GCCATCATCC CCGCGCTCAA 

351 ACGCAAAGAT ATCACGCTTG TCTGCATCAC CGCCCGCCCC GATTCAACCA 

4 01 TGGCGCGCCA TGCCGACATC CACATCACGG CGTCGGTTTC CAAAGAAGCC 

4 51 TGCCCGCTGG GGCTTGCCCC GACCACCAGC ACCACCGCCG TTATGGCTTT 

501 GGGCGATGCG TTGGCGGTTG TCCTGCTGCG CGCCCGCGCG TTCACGCCCG 

551 ACGACTTCGC CTTGAGCCAC CCTGCCGGCA GCCTCGGCAA ACGCCTACTT 

601 TTGCGCGTTG CCGACATTAT GCACAAAGGC GGCGGCCTGC CTGCCGTCCG 

651 ACTCGGCACG CCCTTGAAAG AAGCCATCGT CAGCATGAGT GAAAAAGGGC 

7 01 TGGGCATGTT GGCGGTAACG GACGGGCAAG GCCGTCTGAA AGGCGTATTC 

751 ACCGACGGCG ATTTGCGCCG CCTGTTTCAA GAATGCGACA ATTTTACCGG 

801 TCTTTCGATA GACGAAGTCA TGCATACGCA TCCTAAAACC ATCTCCGCCG 

851 AACGTCTCGC CACCGAAGCC CTGAAAGTCA TGCAGGCAAA CCATGTGAAC 

901 GGGCTTCTGG TTACCGATGC AGATGGCGTG CTGATCGGCG CGCTGAATAT 

951 GCACGACCTT TTGGCGGCGC GGATTGTATA G 



This corresponds to the amino acid sequence <SEQ ID 762; ORF 216.a>: 

a216.pep 

1 MAMAGNEKYL DWAREVLHTE AEGLREIAAD LDENFALAAD ALLHCKGRW 

51 ITGMGKSGHI GRKMAATMAS TGTPAFFVHP AEAAHGDLGM IVDNDWVAI 

101 SNSGESDEIA AIIPALKRKD ITLVCITARP DSTMARHADI HITASVSKEA 

151 CPLGLAPTT S TTAVMALGDA LAWLL RARA FTPDDFALSH PAGSLGKRLL 

201 LRVADIMHKG GGLPAVRLGT PLKEAIVSMS EKGLGMLAVT DGQGRLKGVF 

251 TDGDLRRLFQ ECDNFTGLSI DEVMHTHPKT ISAERLATEA LKVMQANHVN 
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301 GLLVTDADGV LIGALNMHDL LAARIV+ 

ml 1 6/a2 1 6 97.2% identity in 326 aa overlap 

10 20 30 40 50 60 

m21 6 . pep MAMAENGKYLDWAREVLHAEAEGLREIAAELXKNFVLAADALLHCKGRVVITGMVKSGHI 
I I 11 1 I I M M I I I M : I I M I I I I I I : I : M : I ! I I 1 I I I I I I I I I I I I ! Nil) 
a216 MAMAGNEKYLDWAREVLHTEAEGLREIAADLDENFALAADALLHCKGRWITGMGKSGHI 

10 20 30 40 50 60 

70 80 90 100 110 120 

m21 6 . pep GRKMAATMASTGTPAFFVHPMAAHGDLGMIVDXDWVAISNSGESDEIAAIIPALKRKD 
1 I I 11 I I I I II 1 I t 1 I I I I II t 1 I I I I I N I I I I II M I I I I I I I I M 1 I II I I I I I I I 
a2l6 GRKMAATMASTGTPAFFVHPAEAAHGDLGMIVDNDWVAISNSGESDEIAAIIPALKRKD 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2l6.pep ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARA 
I I I I I I 1 I N I I I II I I II I I I II N I I I I I I I I N 1 I II II I I I II I I I I N I I II 1 M 
a2l6 ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m216 . pep FTPDDFALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAIVSMSEKGLGMLAVT 
I I 11 M I I I I I I I I I I I I I I N I I I I 1 I 11 II I N I I I I I 1 N I I I I I I I I 1 I I I I I I I I 
a216 FT P D D FAL S H PAG S LG KRL LLR VA D I MHKGG G L PA VR LG T P LKE A I VSM S EKG LGML A VT 

190 200 210 220 230 240 

250 260 270 280 290 300 

m216.pep DGQGRLKGVFTDGDLRRLFQECDNFTGLS I DEVMHTHPKT I S AERLATEALKVMQANHVN 

N I I I N i I I I I I II N N I II II M I II M N I II II I II I M M I I I I I I M II II I I 
a216 DGQGRLKGVFTDGDLRRLFQECDNFTGLS IDEVMHTHPKTISAERLATEALKVMQANHVN 

250 260 270 280 290 300 

310 320 
m21 6 . pep GLLVTDADGVLIGALNMHDLLAARIVX 
N I I I 1 I I I I N I N II I II N II N I 
a2 1 6 GLLVTDADGVLIGALNMHDLLAARIVX 

310 320 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 763>: 

g2l7 . seq 

1 atggcggatg acggtttgtt gcggcaactg tccgaaaaac ccagccaaag 

51 tgctctcttc ctgccatttg acccattcgt tttcgaggtt ttggactgcc 

101 ttttggtcat cgggcccggc ttgaaacaat gtttcaagca aatcccggca 

151 acgcgccacc cattcgccga ccgtcgcagg ttgccgccat atccgggcaa 

201 tatccgacag ggtttcgagg aaggcggcaa aacgtccgaa catggcggtt 

251 tgattcacgt cggcatacca cgcgctgaca tcctgccaca tcgggttgcc 

301 gccttcgggc agcatccagc ccaatatcat acggtctgcc gcctgcttcc 

3 51 aggtaaacag ctgatccgtg ccgccgcgca tttctccgtc caatccccaa 
401 tggacgttca aatcggcaac catatcgtgc aaaagcggca aatcgtcccc 

4 51 ggtcagtccg aaacggcgca acacgggcgc ggtttccaaa agcgcgagca 
501 ctttgccgac ttcaaaacgg ctttccagca agtcggacac gcactccaac 
551 gcataaaaaa acggttgccg gcggctgatt ttcacgtccg aaacggaata 
601 cggcaatgcc tgcgcgccgg gttgcgcctg tccgaacacg gcttccataa 
651 aaggcgtata gggttcgata ttcggggtta a 

This corresponds to the amino acid sequence <SEQ ID 764; ORF 217.ng>: 

g217 .pep. . 

1 MADDGLLRQL SEKPSQSALF LPFDPFVFEV LDCLLVI GPG LKQCFKQIPA 

51 TRHPFADRRR LPPYPGNIRQ GFEEGGKTSE HGGLIHVGIP RADILPHRVA 

101 AFGQHPAQYH TVCRLLPGKQ LIRAAAHFSV QSPMDVQIGN HIVQKRQIVP 

151 GQSETAQHGR GFQKREHFAD FKTAFQQVGH ALQRIKKRLP AADFHVRNGI 
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201 RQCLRAGLRL SEHGFHKRRI GFDIRG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 765>: 

m217 . seq 

1 ATGGCGGATG ACGGTGTGCG GCGGCAACTG TCCGGAAAAT TGCGCCAATT 

51 CGGTTTCCGC CTrCCATTTG ACCCATTCGT TTTCAAGGTT TTGGACTGAC 

101 TTTTGGTCAT CGGCTTCAGC TTGGAACAAT GTTTCAAGCA AATCCCGGCA 

151 ACGCGCCACC CATTCGCCGA CCGTTGCGGG CTGCCGCCAT ATCCGTACAA 

201 TATCCGTCAG GGTTTCGAGG AAGGCGGCAA AACGTCCGAA CATGGCGGTT 

251 TGATTCACGT CGGCATACCA CGCGCTGACA TCCTGCCACA TCGGATTGCC 

301 GCCTTTGGGC AGCATCCAGC C CAAT AT CAT GCGTTCTACC GCCTGCTTCC 

351 AGGTGAACAG CTGATCCGTG CCGCCGCGCA TTTCTCCGTC CAAACCCCAG 

401 TGGACGTTCA AATCGGCAAC CATGTCGTGC AAAAGCGGTA AATCGTCCTC 

4 51 AGTCAGTCCG AAACGGCGCA ACACGGGCGC GGTTT CT AAA AGCACAAGCA 

501 CTTTATCGAC TTCAAATCGG CTTTCCAACA AGTCGAACAG GCATGACAAA 

551 GCATGAAACA GCGGTTGGCG GCGGCTGATT TTCACGTCTG ACACGGAATA 

601 CGGCAATGCC TGCGCACCgG GctGCGCCTG TCCGAACACG GCTTCGATAA 

651 AAGGCGTATA GGATTCGATA TTCGGGGTTA A 

This corresponds to the amino acid sequence <SEQ ID 766; ORF 217>: 

m217 .pep 

1 MADDGVRRQL SGKLRQFGFR LPFDPFVFKV LDXLLVI GFS LEQCFKQIPA 

51 TRHPFADRCG LPPYPYNIRQ GFEEGGKTSE HGGLIHVGIP RADILPHRIA 

101 AFGQHPAQYH AFYRLLPGEQ LIRAAAHFSV QTPVDVQIGN HWQKRXIVL 

151 SQSETAQHGR GFXKHKHFID FKSAFQQVEQ AXQSMKQRLA AADFHVXHGI 

201 RQCLRTGLRL SEHGFDKRRI GFDIRG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 217 shows 80.5% identity over a 226 aa overlap with a predicted ORF (ORF 217.ng) 
from N. gonorrhoeae: 

m217/g217 

10 20 30 40 50 60 

MADDGVRRQLSGKLRQFGFRLPFDPFVFKVLDXLLVIGFSLEQCFKQIPATRHPFADRCG 

INN: III! I I lllllllhlll Mill M M M M M I M I I M I I 

MADDGLLRQLSEKPSQSALFLPFDPFVFEVLDCLLVIGPGLKQCFKQIPATRHPFADRRR 

10 20 30 40 50 60 

70 80 90 100 110 120 

LPPYPYNIRQGFEEGGKTSEHGGLIHVGIPRADILPHRIAAFGQHPAQYHAFYRLLPGEQ 

Mill M M II I M M I M 1 1 M M M I M 1 1 M M I M I II I M I II h II M I M 

LPPYPGNIRQGFEEGGKTSEHGGLIHVGIPRADILPHRVAAFGQHPAQYHTVCRLLPGKQ 
70 80 90 100 110 120 

130 140 150 160 170 180 

LIRAAAHFSVQTPVDVQIGNHWQKRXIVLSQSETAQHGRGFXKHKHFIDFKSAFQQVEQ 

I M M M 1 1 1 h h I M II I h 1 1 M M MINIMUM MM I II h 1 1 II I = 

LIRAAAHFSVQSPMDVQIGNHIVQKRQIVPGQSETAQHGRGFQKREHFADFKTAFQQVGH 
130 140 150 160 170 180 

190 200 210 220 

AX QSMKQRLAAAD FHVXHG I RQ CLRTGLRLS EHG FD KRR I G FD I RGX 

I I MMI MUM M M 1 1 1 1 M M M II M II M II II M 

ALQR I KKRLiP AAD FHVRNG I RQ CLRAGLRLS EHG FH KRR I G FD I RG 
190 200 210 220 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 767>: 

a217 . seq 

1 GTGGCGGATG ACGGTGTGCA GCGGCAACTG TCCGGAAAAT TGCGCCAATT 

51 CGGTTTCCGC CTGCCATTTG ACCCATTCGT TTTCGAGGCT TTGGACTGCC 

101 TTTTGGTCAT CGCCTTCGAC TTGGAACAAT GTTTCAAGCA AATCCCGGCA 

151 ACGCGCCACC CATTCGTCAA CCGTCGCAGG TTGCCGCCAT ATCCGTACAA 



m217 .pep 
g217 

m217 .pep 
g217 

m217 .pep 
9217 

m217 .pep 
g217 



